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This  grant  supported  research  to  develop  methods  for  parallel  programming  by  masses  of 
people  using  computers  on  networks.  Scientific  results  have  surpassed  initial  expectations. 

The  convergence  of  little-known  Russian  developments  in  program  transformation, 
supercompilation,  and  partial  evaluation  with  the  truly  American  phenomenon  of  Java  programming 
have  produced  surprisingly  strong  results.  This  part  of  the  pNet  (programming  parallel  Networks) 
project  is  the  pjava,  or  parallelizable  Java,  language.  Particularly  exciting  is  the  development  of 
program  transformation  techniques  for  machine  understanding  of  parallel  programs  written  m  the 
familiar  syntax  of  the  important  new  Web  programming  language  Java.  The  remainder  of  the 
research  centers  around  the  Norma  language,  developed  to  help  applied  mathematicians  create 
efficient  fiuid-fiow  codes.  Norma  helps  produce  parallel  programs  that  can  run  efficiently  on 
large  classes  of  parallel  and  distributed  computers,  including  the  Web. 

The  short  eight  months  of  research  supported  by  this  grant  have  produced  many  sound 
scientific  results.  This  work  points  the  way  to  a  grand  unification  of  techniques  in  program 
transformation,  parallelization,  and  compilation  that  will  allow  the  creation  of  libraries  of  reusable 
portable  parallel  codes  that  will  run  efficiently  on  almost  any  computer  system  to  be  found  in  or  on 
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The  Principal  Investigator  made  one  trip  to  Moscow  Russia  in  August  1996  and 
spent  many  hours  on  the  internet  in  connection  with  this  grant  to  develop  methods  for 
parallel  programming  by  masses  of  people  using  computers  and  groups  of  computers  on 
the  network.  Two  Russian  scientists  visited  New  York  for  several  weeks  of  collaboration 
and  familiarization  with  American  research  facilities.  These  trips  strengthened  the 
scientific  collaboration  that  had  developed  during  eight  prior  trips  to  Russia  by  the 
Investigator  to  find  Russian  computer  experts  with  technical  results  that  could  further 
world  science  if  they  were  allowed  to  continue  their  research  amid  the  gales  of  economic 
reform  in  Russia. 

Scientific  results  have  surpassed  initial  expectations.  The  Norma  language,  first 
developed  to  help  applied  mathematicians  create  efficient  fluid-flow  codes  offers  hope 
for  producing  parallel  programs  that  can  run  efficiently  on  large  classes  of  parallel  and 
distributed  computers,  including  the  World  Wide  Web.  Even  more  exciting  is  the 
development  of  program  transformation  techniques  for  machine  understanding  of 
parallel  programs  written  in  the  familiar  syntax  of  the  important  new  programming 
language  Java.  Supercompilation  techniques  for  very-high-level  improvements  of 
programs  can  work  for  Java  programs,  allowing  masses  of  users  to  combine  library 
routines  in  simple  ways  and  yet  produce  highly  efficient  parallel  codes. 

The  primary  goals  for  the  pNet  project  were:  (1)  To  explore  new  principles, 

languages  and  methods  for  creating  efficient  portable  parallel  programs  for  massively 
parallel  computers  and  networks  of  computers;  and  (2)  To  make  it  easy  for  technical 
application  experts  to  specify  elegant  solutions  to  well  understood,  but  possibly 
computationally  complex  information  processing  problems.  The  work  on  the  Norma 
language  has  addressed  these  goals. 

The  secondary  goals  were:  (1)  To  allow  the  creation  and  control  of  massively 
parallel(p)  programs,  run  on  possibly  millions  of  cheaply  available  computers  within 
huge  geographically  distributed  networks(Net);  (2)  To  simplify  the  creation  of  complex 
parallel  programs  using  compositions  of  pretested  program  modules;  (3)  To  make  recent 
Russian  developments  for  parallel  computations  available  to  scientists  in  western 
nations;  and  (4)  To  build  lasting  collegial  and  collaborative  ties  between  American  and 
Russian  scientists. 

The  ONR  sponsored  series  of  personal  scientific  interactions  between  American 
scientists  (including  this  Investigator)  and  computer  scientists  in  the  institutes  of  the 
Russian  Academy  of  Sciences  in  Moscow  and  Novosibirsk  have  led  to  an  unanticipated 
convergence  of  American  and  Russian  computing  cultures.  Combining  little-known 
Russian  developments  in  the  program  transformation  disciplines  of  supercompilation 
and  partial  evaluation  with  the  truly  American  phenomenon  of  Java  programming  have 
produced  surprisingly  strong  scientific  results.  The  Russians  taught  the  investigator 
about  program  improvements  based  on  computer  understanding  and  simplification  of 
human  produced  programs.  The  Investigator  suggested  that  the  methods  might  speed 
up  routinely  interpreted  Java  programs  without  sacrificing  the  Java  guarantees  of  safety 


from  rogue  programs.  In  1995,  most  Russian  scientists  had  not  then  heard  of  Java.  The 
result  is  a  major  part  of  the  pNet  project:  that  of  developing  the  pJava,  or  parallelizable 
Java,  language  and  program  creation  tools  based  on  it.  To  produce  these  results,  the 
Investigator  and  his  Russian  colleagues  have  had  to  share  the  chaos  and  privations  of 
the  Russian  economic  collapse,  bridge  long-instilled  cold- war  chasms  of  fear  and 
distrust,  and  overcome  an  awkward  language  barrier.  The  results  of  the  collaboration  at 
at  both  the  scientific  and  human  levels  have  been  worth  the  discomforts. 

Much  of  the  first  six  months  of  funding  for  this  project  were  spent  climbing  the 
mountains  of  paperwork  required  to  support  five  Russian  researchers  for  eight  months 
in  1996.  The  two-week  trip  to  Moscow  in  August  allowed  the  investigator  to  help  build 
modern  personal  computer  systems  for  research  on  Norma,  pJava,  and  supercompilation 
at  two  research  centers  in  Russia,  Keldysh  Institute  of  Applied  Mathematics  in  Moscow 
and  the  Program  Systems  Institute  in  Pereslavl-Zalessky.  Both  institutes  are  part  of  the 
Russian  Academy  of  Sciences.  More  importantly,  it  allowed  his  Russian  colleagues  to 
keep  their  hopes  alive  after  waiting  in  vain  four  months  to  be  paid  for  their  work  on  the 
project.  The  trip  also  permitted  the  investigator  to  acquire  identical  dual  English-Cyrillic 
keyboards  for  use  in  both  Russia  and  the  United  States  by  team  members.  During  their 
trip  to  the  New  York  in  November,  his  two  visitors  installed  operating  system  software 
compatible  with  that  on  their  machines  in  Russia.  This  seemingly  small  detail  of  having 
the  same  keyboard  and  compatible  personal  computer  software  has  made  possible  the 
efficient  production  of  joint  scientific  reports. 

The  bulk  of  the  scientific  developments  from  this  project  are  contained  in  the  six 
technical  reports  attached  to  this  final  report.  Together  they  cover  nearly  two  hundred 
pages  of  analyses  in  applied  mathematics  and  computer  science.  Three  of  the  reports 
are  newly  written  for  this  project.  Three  are  translations  of  Norma  design  documents 
previously  available  only  in  Russian.  The  rest  of  this  report  gives  the  abstracts  of  the 
six  technical  papers  in  the  Appendix. 


Program  Transformations  for  Java 

For  the  investigator,  the  most  exciting  work  is  given  in  the  paper  entitled,  “Program 
Transformations  for  Java”  by  Andrei  V.  Klimov  of  the  Keldysh  Institute  for  Applied 
Mathematics  in  Moscow,  Valentin  F.  Turchin  of  The  City  College  of  New  York,  and  Larry 
D.  Wittie  of  SUNY  at  Stony  Brook  in  New  York.  It  shows  how  supercompilation  can  be 
used  to  transform  parallel  and  distributed  programs  written  in  (p)Java  and  make  them 
much  more  efficient.  It  also  gives  clear  examples  of  how  a  Java  supercompiler  works  to 
understand  the  deep  meaning  of  (p)Java  programs  and  can  produce  simpler  equivalent 

programs. 

A  proeram  transformation  system  tor  Java  is  presented.  Two  program  understanding  and 
transformation  disciplines  are  reviewed;  superconipilaiion  and  partial  e\'aluation.  Supercompilation 
is  more  general  and  powerful:  partial  e\^aluation  is  a  subset,  but  simpler  to  understand  and  to  use. 

Transformation  methods  were  originally  developed  for  functional  languages.  Here,  for  the  first 
time,  is  a  demonstration  of  supercompilation  for  the  imperative  language  Java.  We  show  how 
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the  main  phases  of  a  supercompiler  work  for  Java:  configuration,  driving,  configuration  analysis. 

How  to  supercompile  Java  is  explained  via  a  Producer-Consumer  example,  including  a  trace  of 
manual  supercompilation  steps. 

The  strong  points  of  this  report  are: 

It  gives  an  excellent  review  of  program  transformation  methods; 

It  shows  that  the  syntax  of  the  popular  Java  programming  language  can  be  used  to  write  computer- 
understandable  portable  parallel  programs  that  automatically  can  be  deeply  analyzed,  optimized 
and  tailored  to  run  efficiently  for  many  specific  problems  or  computer  system  architectures;  and 
Its  major  scientific  contribution  is  a  demonstration  that  supercompilation  techniques  are  already 
powerful  enough  to  fuse  several  concurrent  threads  into  one,  dramatically  increasing  the  efficiency 
of  some  computer  programs. 

Supercompilers  can  perform  significant  inter-thread  analysis  and  source  code  optimization.  We 
explain  why  and  how  we  plan  to  develop  and  implement  a  prototype  supercompiler  for  Java. 

Effectively  a  supercompiler  can  deduce  the  essential  meaning  of  calls  to  many  code 
functions  and  incorporate  that  meaning  into  highly-restructured  faster-running  codes. 


pjava  -  A  Parallel  Superset  of  Java 


The  second  report  covers  the  planned  steps  in  the  development  of  an  implicitly 
parallel  version  of  the  Java  programming  language,  called  pJava.  The  paper  is  entitled, 
“pJava  -  A  Parallel  Superset  of  Java  for  Automatic  Parallelization”  and  was  written  by 
Andrei  V.  Klimov  of  the  Keldysh  Institute  for  Applied  Mathematics  in  Moscow  and  Larry 
D.  Wittie  of  SUNY  at  Stony  Brook  in  New  York. 

An  extension  of  Java,  called  pJava,  which  allows  for  automatic  parallelization  and  efficient  program 
transformation,  is  presented.  The  idea  of  pJava  is  to  select  a  Java  subset  with  suitable  properties, 
and  then  gradually  to  extend  it  while  preserving  the  properties.  The  starting  point  for  pJava  is  a 
purely  functional  subset  of  Java  with  data  limited  to  immutable  objects.  Higher-level  declarative 
notions  will  gradually  be  added  to  allow  for  easy,  reliable,  implicitly  parallel  programming. 
Ultimately  mutable  objects  will  be  allowed  as  well,  but  under  a  special  programming  discipline.  This 
discipline  can  be  satisfied  at  a  low-level  by  a  qualified  Java  programmer,  or  at  a  high-level  via 
constructs  based  on  monotone  objects,  which  can  be  easily  applied  by  less-experienced  users. 

The  pjava  language  will  allow  programs  to  be  written  that  can  be  ported  to  new 
computer  systems  and  new  application  areas,  and  still  be  analyzed  and  improved 
automatically  by  supercompilation  system  algorithms.  More  rapid  execution  can  result 
both  from  program  transformations  to  produce  faster,  simpler,  guaranteed-equivalent 
source  codes  and  from  automatic  parallelization  to  use  many  computers  at  once  on  parts 
of  the  same  code. 
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Creation  of  Efficient  and  Portable  Parallel  Programs 


The  third  report  is  entitled,  “Creation  of  Efficient  and  Portable  Parallel  Programs”  and 
was  written  under  the  direction  of  Igor  B.  Zadykhailo  of  the  Keldysh  Institute  for 
Applied  Mathematics  in  Moscow.  It  gives  a  detailed  semantic  definition  for  the  Norma 
parallel  programming  language  for  applied  mathematics.  This  precise  definition  is 
needed  to  build  the  program-analysis  systems  that  underlie  both  supercompilation 
transformations  and  automatic  parallelization  of  mathematical  solution  algorithms  for 
problems  in  applied  mathematics. 

The  first  publications  appeared  in  late  50s  -  early  60s.  The  main  idea  of  the  language  later  named  NORMA 
is  very  simple.  It  was  an  attempt  to  automate  the  design  of  the  programs  based  on  the  jobs  prepared  by 
applied  mathematicians  from  Keldysh  Institute  of  Applied  Mathematics  for  further  programming.  Usually 
those  jobs  were  the  result  of  applying  numerical  methods  (more  often  grid  method)  to  physical  problems’ 
solution.  The  intention  was  to  create  the  language  of  jobs’  specification  corresponding  to  the  constructions 
obtained  after  mathematical  solution  of  the  problem. 

We  have  gained  much  experience  in  designing  complicated  program  systems  and  translators,  now  it  is  time 
to  create  really  friendly  programming  languages.  We  shall  take  a  decisive  step  and  turn'  from  universal 
languages  to  the  languages  for  users  to  formulate  problem  solution  in  generic  terms.  We  are  sure  that 
universal  languages  may  be  friendly  only  to  system  programmer.  Hence  we  shall  bend  every  effort  to 
creation  of  specialized  language  for  each  application  domain. 

As  the  specification  in  the  NORMA  language  contains  full  specification  of  an  algorithm  and  doesn’t  reflect 
any  peculiarities  of  the  computer.  It  may  be  implemented  on  the  computer  with  any  architecture  both 
sequential  or  parallel.  Synthesising  translator  must  be  automatically  adapted  to  the  peculiarities  of  the 

archiiecture  or  must  allow  for  the  peculiar  architecture. 

As  the  user  is  free  from  the  necessity  of  making  a  program  (this  part  is  carried  out  by  a  translator)  then 
such  process  of  programming  doesn’t  cause  mistakes  (up  to  the  reliability  of  a  translator).  Further  more 
besides  the  traditional  syntactical  and  semantic  diagnostics  the  synthesising  translator  can  send  mess^es 
about  the  errors  in  the  essential  notions.  E.g.,  impossibility  of  organising  computation  caused  by 
insufficient  initial  data,  by  the  mistakes  in  the  index  displacements  etc. 

High  efficiency  of  automatically  designed  program  is  based  on  the  capability  of  deep  parallelising  and 
providing  the  necessary  level  of  parallelism  granularity.  Generally  speaking  the  synthesising  translator  has 
the  capabilitv’  of  estimating  the  different  variants  of  the  representation  of  the  declarative  specification  in  the 
program  and  finding  of  the  best  one  according  to  the  built-in  rule  or  in  the  dialogue  with  the  system 
programmer  or  the  user  himself 

Formal  semantics'  specification  of  the  NORMA  language  versions  1-22  are  given  in  this  part.  Semantics’ 
specification  is  based  on  the  application  of  operational  approach  and  fomial  methods  of  relational  algebra. 
References  to  syntactical  notations  introduced  in  specification  of  the  NORMA  language  by  extended 
notation  of  Backus-Naur  is  used  here. 

A  Norma  to  pJava  translation  system  is  among  the  first  software  products  planned 
for  this  part  of  the  pNet  project.  Not  only  do  program  analysis  routines  for  Norma  have 
much  in  common  with  those  for  pJava,  but  technical  developments  for  pJava  directly 
will  make  faster  running  Norma  target  codes  for  a  wide  range  of  computer  architectures. 
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Translations  of  Recent  Norma  Papers  From  Russian 


The  last  three  reports  are  translations  from  Russian  to  English  of  recent  papers  in  the 
development  of  the  Norma  programming  language  and  its  underlying  program-analysis 
and  -compilation  systems.  There  are  also  four  older  Norma  papers  still  only  in  Russian. 

NORMA  -  Language  specification  -  Draft  copy  1.22 

This  is  the  latest  version  of  the  formal  specifications  of  Norma  language  syntax,  entitled: 
"NORMA  -  Language  specification  -  Draft  copy  1.22”.  It  is  a  1996  revision  of  the  earlier 
50  page  preprint  numberl20  “The  specification  of  the  NORMA  language.  Draft  Standard,” 
published  at  the  Keldysh  Institute  of  Applied  Mathematics  in  Moscow  in  1995  by  the 
same  authors:  Alexander  Nikolaevich  Andrianov,  Alexander  Borisovich  Bugerya,  Kyrill 
Nikolaevich  Efimkin,  and  Igor  Borisovich  Zadykhailo.  It  gives  the  form  of  all  statements 
in  the  Norma  language. 

The  NORMA  language  is  a  tool  aimed  at  automatic  solution  of  the  mathematical  physics  problems  on 
parallel  computer  systems.  The  aim  of  the  NORMA  language  is  to  eliminate  the  programming  phase  which 
is  necessary  to  pass  from  computational  formulae  derived  by  an  application  specialist  to  a  computer 
program.  There  is  no  essential  difference  between  computational  formulae  and  NORMA  program 
structures  -  these  formulae  are  an  input  for  the  NORMA  translating  system.  In  fact  NORMA  program  is  a 
nonprocedural  specification  of  problems  to  be  solved.  The  mathematical  problems  connected  with  the 
synthesis  of  output  program  are  solvable  in  the  case  of  the  NORMA  language.  Draft  specification  of  the 
NORMA  language  is  given. 


Organization  of  loop  computations  in  NORMA  language 

This  is  an  slightly  older  paper  by  Alexander  N.  (Sasha)  Andrianov  about  the  compilation 
methods  to  use  in  determining  the  execution  order  of  operations  in  Norma  loops: 
“Organization  of  Loop  Computations  in  the  NORMA  Language.”  It  was  preprint  number 
171  printed  in  Moscow  in  1986  by  the  Keldysh  Institute  of  Applied  Mathematics.  The 
last  of  the  three  papers  is  a  continuation  of  this  report,  which  was  26  pages  in  Russian. 

The  problems  of  loop  process  organization  for  the  program  written  in  nonprocedural  language  NORMA 
are  considered  in  this  paper.  An  algorithm  of  designing  the  system  of  simple  loops  allowing  parallel 
processing  is  given.  The  algorithm  is  based  on  the  notion  of  computation's  front  which  is  a  hyperplane 
where  \ariables‘  \alues  may  be  computed  in  ever}'  its  point.  The  task  of  Linear  Integer  Programming  is 
solved  for  determination  of  the  hyperplane's  parameters. 

Key  words:  nonprocedural  language,  synthesis  of  the  program,  parallel  computations. 
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Organization  of  loop  process  on  specification 


This  paper  “Organization  of  loop  process  on  nonprocedural  specification”  by  A.N. 
Andrianov  and  E.A.  Andrianova  is  a  continuation  of  the  1986  on  organization  of  loop 
operations  in  Norma,  number  2  in  this  list  of  translated  papers.  It  was  completed  in 
Moscow  in  1996. 

NORMA  is  a  programming  language  [1]  aimed  at  automation  of  mathematical  physics  problems  solutions 
on  parallel  computer  systems. 

The  NORMA  language  allows  elimination  of  a  programming  phase  in  transition  from  formulae  specified 
by  a  technical  expert  to  a  program  itself.  There  is  no  much  difference  between  formulae  and  NORMA 
specifications.  In  fact  these  formulae  are  input  data  for  a  translator. 

Synthesis  of  output  program  is  carried  out  automatically  during  the  translation  from  NORMA.  The  order 
and  the  way  of  performing  calculations  (parallel,  vector  or  sequential)  is  determined  automatically.  The 
order  of  the  language’s  sentences  is  arbitrary  ( information  dependencies  are  revealed  and  taken  into  account 
during  the  organization  of  computing  process  ).  There  are  no  such  programming  terms  as  memory,  loop, 
control  operators  in  the  language.  Output  program  is  generated  with  the  architecture  of  a  target  computer  as 
a  guide. 

In  fact  the  program  in  NORMA  is  a  nonprocedural  specification  of  the  problem  to  be  solved.  The 
synthesis  of  output  program  raises  some  mathematical  problems  but  they  are  solvable  in  the  case  of 
NORMA  language. 

Some  Norma  peculiarities  makes  the  process  of  automatic  object  program  design  available  for  practice 
realization.  They  are: 

1 .  index  expressions  of  caicuiated  variables  has  the  form  /  ^  c  where  i  -  index  name,  c  -  integer  constant. 

2.  NORMA  is  a  language  with  single  assignment.  Any  value  can  be  assigned  to  a  variable  only  once  ( 
only  once  to  each  point  of  domain  -  to  the  variables  defined  on  domain  ).The  first  constraint  defines  the 
class  of  formulae  which  can  be  used  for  the  problem’s  solution.  It  isn’t  strict  in  practice  as  the  index 
expressions  of  other  type  are  very  rare. 

Memory  allocation  and  the  problems  of  its  economy  caused  the  second  constraint.  These  problems  can  be 
solved  at  the  translation  stage.  The  second  constraint  simplifies  the  problem  of  output  program  synthesis. 

The  problem  of  output  program  is  to  be  solved  during  the  translation.  Solving  this  problem  is  based  on  the 
analysis  of  the  graph  of  information  dependencies.  The  Most  Strongly  Connected  subGraphs  (MSCG)  are 
chosen  from  the  graph.  In  general  case  organization  of  computations  for  the  nodes  requires  use  of  special 
methods. 

Assume  is  a  principal  operator  in  NORM.\.  This  operator  sets  the  relations  betw-een  variables  being 
calculated  on  a  domain.  Researching  on  the  subject  of  computational  process  organization  has  been  doing 
for  a  long  time.  The  purpose  of  this  paper  is.  to  specify  the  method  of  designing  loop  operators  which 
realizes  the  relations  included  in  MSCG  (Most  Strongly  Connected  subGraphs). 

What  we  are  discussing  is  an  amalgamation  of  program  understanding  and 
transformation  technologies  to  let  computers,  without  help  from  ultra-skilled  humans, 
write  efficient  working  programs  in  the  future  world.  Our  computers  are  becoming 
powerful  enough  to  understand,  to  manipulate  and  to  improve  their  own  programs,  at 
least  for  most  mundane  applications.  They  are  becoming  complex  enough  that  soon  only 
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they  will  be  able  to  write  efficient  codes  for  the  many  variants  of  parallel  and 
distributed  computing  systems.  Humans  should  only  point  the  way  to  new  algorithms 
for  new  problems.  Computers  should  handle  all  details  of  producing  fast  running  codes. 

The  short  eight  months  of  research  supported  by  this  grant  have  produced  many 
sound  scientific  results.  This  work  points  the  way  to  a  grand  unification  of  techniques  in 
program  transformation,  parallelization,  and  compilation  that  will  allow  the  creation  of 
libraries  of  reusable  portable  parallel  codes  that  will  run  efficiently  on  almost  any 
computer  system  to  be  found  in  or  on  the  computer  networks  of  the  world. 

Appendix: 


1)  Program  Transformations  for  Java  28  pp. 

2)  pjava  -  A  Parallel  Superset  of  Java  10  pp. 

3)  Creation  of  Efficient  and  Portable  Parallel  Programs  54  pp. 

4)  NORMA  -  Language  specification  -  Draft  copy  1.22  46  pp. 


5)  Organization  of  Loop  Computations  in  the  NORMA  Language  25  pp. 

6)  Organization  of  loop  process  on  nonprocedural  specification  19  pp. 
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Abstract 

A  program  transformation  system  for  Java  is  presented.  Two  disciplines  are  reviewed: 
supercompilation  and  partial  evaluation.  Supercoti^ilation  is  more  general  and 
powerful;  partial  evaluation  is  a  subset,  but  simpler  to  understand  and  to  use. 
Transformation  methods  were  originally  developed  for  functional  languages.  Here,  for 
the  first  time,  is  a  demonstration  of  supercompilation  for  the  imperative  language  Java. 
We  show  how  the  main  phases  of  a  supercon^iler  work  for  Java:  configuration, 
driving,  configuration  analysis.  Supercompilation  of  Java  is  explained  via  a  Producer- 
Consumer  sxamplQ,  including  a  trace  of  manual  supercompilaticm  steps. 

The  strong  points  of  this  r^ort  are: 

It  gives  an  excellent  review  of  program  transformation  methods. 

It  shows  that  the  syntax  of  the  popular  Java  programming  language  can  be  used  to 
write  computer-understandable  portable  parallel  programs  that  automatically  can  be 
deeply  analyzed,  optimized  and  tailored  to  run  efficiently  for  many  specific  problems  or 
computer  system  architectures;  and 

Its  major  sciaitific  contribution  is  a  demonstration  that  supercompilation  techniques 
are  already  powerful  enough  to  fuse  several  concurrent  threads  into  one,  dramatically 
increasing  the  efBciency  of  some  computer  programs. 

Supercompilers  can  perform  significant  interthread  analysis  and  source  code 
optimization.  We  e?q}lain  w^y  and  how  we  plan  to  develop  and  inclement  a  prototype 
supercompiler  for  Java. 

This  research  has  been  supported  by  the  Office  of  Naval  Research:  grant  00014-96-1-0800. 
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Overview  of  Program  Transformation  Methods  for  Java 


Introduction 

Why  supercompilation? 

Among  the  variety  of  program  transformation  techniques,  two  methods  perform  especially  deep 
optimizaticm: 

partial  evaluation  [Jones  et  al  93],  and 
supercompilation  [Turchin  86,  88,  93,  96], 

Originally,  these  were  developed  for  pure  functional  languages,  but  now  have  become  mature  aiough  to 
be  applied  to  wide-spread  practical  languages  like  Java.  Partial  evaluation  is  sin:5)ler  than 
supercompilation  and  is  better  studied.  The  former  is  actually  a  case  of  the  later.  A  year  ago,  in  our 
proposal  to  ONR,  we  planned  to  start  by  constructing  a  partial  evaluator  for  Java,  and  to  turn  to 
supercorrqjilation  only  vriien  we  saw  that  the  siirqjler  method  turned  out  well  and  we  understood  all 
subtleties  in  the  transformation  of  Java  programs.  However,  our  research  in  1996  has  shown  that  we 
were  too  timorous.  Hence,  this  r^ort  centers  around  supercompilation.  Our  main  contribution  is  that 
we  show  how  the  problem  of  fusion  of  several  threads  into  one  can  be  solved  by  supercompilation.  One 
of  the  reasons  why  we  have  laid  partial  evaluation  aside  is  that  it  cannot  fuse  threads,  although  we  do 
not  see  any  significant  problem  in  applying  it  to  Java. 

At  the  current  stage  of  research,  for  Java  we  plan  to  use  only  well-established  methods  of 
supercompilation,  ones  which  have  already  bear  proved  for  functional  languages.  Only  after  we  see 
how  these  work  in  prototype  implementation,  will  we  consider  using  advanced  techniques.  This  report 
presents  the  results  of  a  redevelooment  of  basic  supercompilation  notions  with  respect  to  Java.  We  are 
pleased  to  find  that  the  main  ideas  remain  the  same,  although  a  lot  of  details  are  new. 

Why  Java? 

Success  in  program  transformation  disciplines  like  supercompilation  greatly  depends  on  language 
properties.  It  is  often  said  that  functional  languages  are  better  than  imperative  for  such  a  task.  Indeed, 
partial  evaluation  applies  to  constant  subexpressions,  that  is,  program  parts  that  belong  to  a  functional 
subset.  However,  Ais  is  not  essential  for  supercompilation,  which  models  the  operational  behavior  of  a 
program.  What  actually  matters  for  all  kinds  of  program  transformation  is  the  data  model  of  a 
language  rather  than  control. 

The  majority  of  wide-spread  languages,  from  Fortran  to  C-H-,  are  based  on  the  data  model  of  a  von 
Neumann  computer.  TOs  means  that  data  definition  is  two-level.  First,  clear  abstract  data  domains  are 
defined:  integers,  strings,  records,  objects,  etc.  Second,  these  are  mapped  into  bit  sequaices  and  the 
one-dimensional  array  of  memory  cells.  A  lot  of  operations,  and  even  some  ofthe  data  types  (e.g. 
pointers),  refer  to  the  second  part  ofthe  definition:  data  coercion  and  allocate/deallocate  structures. 

This  two-level  data  model  badly  complicates  program  analysis  and  transformation,  and  is  a  real 
(probably,  the  only)  obstacle  for  bringing  the  achievements  of  computer  science  into  practice. 

Tne  reason  why  functional  and  logic  languages  are  better  suited  for  advanced  computing  techniques  is 
that  from  the  very  beginning  they  avoided  the  pressures  of  program  efficiency,  and  accepted  a  clean 
one-level  data  model.  The  semantics  of  programs  in  languages  with  a  one-level  data  model  can  be 
analyzed  and  manipulated  more  easily  by  other  computer  programs.  Human  programming  skills  are  not 
needed  for  all  aspects  ofthe  production  of  efficient  code. 
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Only  a  few  widely-known  languages  from  the  imperative  frimily  come  close  to  having  a  good  one-level 
data  model;  Algoi-68,  Ada,  and  Java.  And  that  is  all.  We  don’t  mention  such  languages  as  Oberon  and 
Modula-3,  since  these  were  not  widely  used).  Java  is  the  simplest  of  the  three,  but  the  most  interesting, 
since  it  actually  supports  two  programming  paradigms.  It  is  both  object-oriented  and  has  a  Junctional 
subset. 

By  saying  that  a  language  has  a  frmctional  subset  we  in:q}ly  that 

•  its  data  are  developed  well  enough  to  represent  complex  values  (at  least  trees)  by  immutable  data 
structures; 

•  functions  and  procedures  can  return  any  values. 

To  satisfy  these  requirements,  the  language  implementation  must  use  garbage  collection. 

Experts  in  functional  programming  usually  put  forward  a  third  requirement  that  higher-order 
programming  is  also  supported,  that  is  functicxis  may  be  used  as  first  class  data.  However,  this  is  not 
important  for  program  transformation.  On  the  other  hand,  Java  is  good  with  respect  to  this  feature  as 
well,  since  functions  as  values  are  easily  modeled  by  objects.  Java  satisfies  the  functional  programming 
requirements  best  of  all  wide-spread  imperative  languages.  Indeed,  in  [Wadler,  Odersky  97]  it  is 
demcmstrated  that  Java  notions  are  sufficient  to  ext^d  it  syntacticaUy  to  a  fully  functional  programming 
language. 

What  is  supercompilation? 

This  subsection  gives  an  overview  of  the  notions  and  algorithms  of  supercompilation.  These  are 
e;q)lained  in  more  detail  along  with  examples  in  the  next  secticm. 

In  short,  a  supercompiler  executes  a  program  in  general  terms,  with  some  data  replaced  by  variables, 
analyses  the  trace  and  constructs  an  equivaloit  residual'  program,  wdiich  is  usually  much  more  efficient 
that  the  original  one. 

In  more  detail,  the  definiticm  of  supercompilation  is  two-level.  The  lower  level  is  a  potentially  infinite 
process  called  driving.  The  upper-level  is  conjiguration  analysis,  a  process  which  supervises  the  trace 
of  driving,  performs  various  operations  on  the  trace,  controls  driving,  and  constructs  a  finite  graph  of  a 
residual  program,  thus,  compiles.  Hence,  the  term  super-compilation. 

The  notion  of  a  configuration 

The  central  notion  and  the  main  object  of  the  superconpiler  is  a  conjiguration.  A  conjiguration  is  the 
representation  of  (a  part  of)  a  generalized  state  of  the  machine  executing  the  source  program.  It 
resembles  a  state  representation  in  an  interpr^r,  but  contains  additional  features.  To  represent  sets, 
free  variables,  referred  to  as  conjig^iration  variables,  occur  in  configurations  instead  of  ground  values 
(final  values).  The  details  of  Java  configurations  are  defined  in  the  next  section. 

Driving 

The  basic  process  in  supercompilation  is  driving,  that  is  execution  of  a  source  program  in  terms  of 
configurations. 


'  In  accordance  with  the  partial  e\’aluation  tradition,  we  refer  to  a  result  program,  its  statements  and 
e.\pressions  as  residual.  The  act  of  generating  a  residual  statement  or  a  residual  expression  is  called 
residualriation. 

4 


CK’erview  of  Program  Transformation  Methods 


Driving  starts  with  wie  or  more  initial  configurations  supplied  by  the  user.  Each  initial  configuration 
generates  a  task  to  construct  a  residual  procedure.  It  contains  initial  configuration  variables,  which 
become  parameters  of  the  residual  procedure.  In  order  just  to  optimize  a  procedure,  the  user  gives  the 
initial  configuration  that  represents  the  call  to  the  procedure  with  all  arguments  taking  on  different 
configuration  variables.  To  specialize  a  procedure,  the  user  gives  known  values  to  some  arguments. 

Driving  gradually  cOTStructs  a  potentially  infinite  driving  tree  (also  referred  to  as  a  process  tree),  whidi 
rqjresents  the  set  of  all  traces  of  source  program  execution  fiar  the  set  of  its  initial  states  covered  by  an 
initial  configuration.  The  nodes  of  the  tree  correspond  to  configurations,  the  arcs  to  the  steps  of 
execution  of  the  source  program.  The  leaves  are  passive  configurations  that  ccmtain  no  procedure  calls 
and  usually  correspravd  to  the  return  statement  yielding  a  result  value. 

A  step  of  driving  is  either  transient,  or  produces  a  residual  statement  (a  residualization  step). 

Transient  driving  is  execution  of  steps  as  if  by  an  interpreter,  as  long  as  configuration  variables  do  not 
interfere.  Transiait  steps  result  in  a  linear  segment  ofthe  driving  tree.  No  statements  are  residualized 
during  transient  driving. 

Whai  an  unknown  represaited  by  a  configuration  variable  prohibits  executiwi  of  a  source  program 
statement,  or  a  procedure  called  in  the  statement  is  not  given,  the  statement  is  residualized.  The  values 
of  program  variables,  including  the  initial  cwifiguration  variables,  are  substituted  into  the  stetemoit, 
and  the  result  statement  is  residualized  and  stored  in  the  node  of  the  driving  tree  together  with  the 
configuration.  For  an  imconditional  residual  statement,  only  one  arc  proceeds  from  the  node. 

Otherwise,  two  or  more  arcs  leave  the  node.  The  arcs  enter  nodes  corresponding  to  possible  next 
configurations. 

The  next  configuration  may  be  narrowed  to  represent  the  informaticm  revealed  by  a  stq).  For  example, 
after  a  test  “if  X3=5  ; . .  where  X3  is  a  configuration  variable,  it  is  known  that  X3  equals  to  5  00  the 
positive  branch.  The  narrowing  information  is  r^resented  by  two  means.  First,  by  contraction,  that  is 
by  substituting  some  values  instead  of  configuration  variables  (X3 1->  5  in  our  example).  Secwid,  by 
adding  the  information  to  restrictions,  which  are  kept  together  with  configurations.  In  the  basic  case 
(which  we  plant  to  implemait  for  Java),  restrictions  are  inequalities  (for  example,  X3  5).  More  details 

are  given  in  the  next  sections. 

Configuration  analysis 

The  driving  tree  may  be  considered  as  an  infinite  representation  of  the  residual  program.  Indeed,  define 
its  interpretation  as  follows.  Forget  configurations,  keep  in  nodes  only  residual  statements  and  lists  of 
configuration  variables.  Regard  configuration  variables  as  program  variables.  Assign  values  to  the 
initial  configuration  variables  and  start  from  the  initial  node.  Execute  the  residual  statement  kept  m  the 
current  node.  If  only  one  arc  starts  from  the  node,  go  to  the  next  node.  If  several  arcs  proceed  from  the 
current  node,  thai  the  residual  statement  is  conditional,  and  its  execution  has  selected  one  of  the  arcs. 

Go  to  the  corresponding  node.  And  so  forth,  until  a  leaf  is  reached.  The  value  returned  by  the 
expression  in  the  leaf  is  the  result  of  interpretation. 

The  interpreter  would  work  similarly  if  given  a  finite  graph  rather  than  a  tree.  The  finite  graph  would 
be  an  eligible  result  of  compilation.  So,  the  only  (but  the  hardest!)  problem  is  to  fold  the  infinite  dnvmg 
tree  into  a  finite  graph.  This  is  the  task  for  configuration  analysis. 

The  following  operations  on  the  tree  can  be  performed  to  fi>ld  h: 

Looping  back:  If  a  configuration  C  is  a  subset  of  one  of  the  previous  ones,  C  ,  and  can  be  reduc«l  to  it 
by  a  substituti(Mi  |a,  C  =  |i  C  a  looping  back  arc  (corresponding  to  a  goto  statement  in  the  residual 
program)  is  added  to  the  graph  plus  appropriate  assignment  statements  reflecting  the  substitution  p. 
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Generalization:  A  subtree  starting  from  a  node  with  configuration  C  can  be  thrown  away  and  replaced 
by  a  subgraph  starting  with  a  more  general  configuration  C*,  such  that  C  =  p.  C*  fi)r  some 
substitution  p.  The  residual  assignments  corresponding  to  the  substitution  p  are  put  into  node  C. 

The  new  graph  rqjresents  an  equivalent  but  less  efficient  residual  program. 

Cutting  a  configuration:  A  subtree  starting  from  a  node  with  configuration  C  can  be  thrown  away  and 
replaced  by  two  subgraphs  starting  with  ccHifigurations  C\  and  C2,  which  are  parts  of  the  original 
CMifiguration  C,  and  C  is  a  kind  of  composition  of  Cj  and  C2.  We  will  not  go  into  details  yet. 
However,  the  residual  statement  correspcmding  to  C  is  a  call  to  a  procedure  Cj,  wdiile  C2  is  the 
continuation  of  C  after  Ci  returns. 

These  operations  are  sufficient  to  fold  any  infinite  driving  tree  into  a  finite  graph.  However,  to  detect 
particular  configurations  to  loop  back,  to  generalize  or  to  cut  is  the  hardest  problem  of 
supercompilation.  As  a  first  approximation,  we  will  use  the  method  described  in  [Turchin  88]  after 
modifying  it  for  multiple  threads.  The  idea  is  that  oily  ftie  structure  of  threads  is  used  to  make 
decisions.  The  values  bound  to  program  variables  are  not  taken  into  account.  This  method  guarantees 
termination  of  supercompilation.  The  details  are  beyond  the  scope  of  this  report. 

What  is  partial  evaluation? 

Partial  evaluaticm  is  a  simplification  of  supercompilation  in  several  respects. 

1.  In  supercompilation  one  residual  program  points  can  emerge  from  several  source  program  points. 
This  property  is  referred  to  as  polygenetic,  versus  monogenetic  [Romanenko  90].  Partial  evaluation 
processes  each  procedure  separately.  Hence,  one  residual  program  point  corresponds  to  only  one 
source  program  point.  This  is  monogenetic  program  transformation.  Several  residual  procedures 
can  still  be  produced  from  one  source  procedure.  In  partial  evaluation,  there  is  no  problem  in 
determining  how  to  cut  a  configuration.  We  may  say  that  cutting  happens  immediately  at  each 
procedure  call. 

2.  A  partial  evaluator  executes  subexpressions  depending  only  on  known  values.  Cases  when 
unknowns  are  present  in  data,  but  do  not  interfere,  are  not  considered  as  executable  by  partial 
evaluation  and  are  always  residualized.  Compared  to  supercompilaticm,  partial  evaluation  leaves 
many  more  Statements  for  executicm,  not  eliminated  at  compile  time. 

3.  The  decision  when  and  what  to  generalize  is  taken  in  advance,  before  known  arguments  are  supplied, 
by  a  preprocessor  that  performs  a  so-called  binding  time  analysis  (BTA).  The  problem  of 
generalization  is  drastically  simplified  as  well  for  partial  evaluation. 

Binding  time  analysis 

Partial  evaluation  is  used  to  specialize  procedures.  Given  a  procedure  of  several  argumaits,  say 
void  P(int  x,y,z),  one  wants  to  produce  a  procedure  Q  with  fewer  arguments,  say  z,  with  others  having 
been  bound  to  known  values,  say  x=l,  y=2.  The  implicit  definition  of  Q  is: 

void  Q(int  z)  { 

P(l,2,z); 

} 

void  P(int  x,y,z)  { 

} 

A  partial  evaluator  generates  specialized  versions  of  codes  by  executing  static  sube^qiressions,  ones 
that  depend  only  on  parameters  with  known  values.  All  other  expressions  are  dynamic.  Before  the 
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source  program  is  simplified,  it  is  evaluated  by  binding  time  analysis  (BTA)  to  separate  and  annotate 
static  subexpressions.  A  parameter  of  a  procedure  is  considered  static  if  it  takes  static  arguments  in  all 
procedure  calls.  BTA  iterates  its  abstract  interpretation  until  a  fixed  point  is  reached.  When  it  starts, 
all  parameters  of  all  procedures  are  hypothetically  considered  static,  except  the  dynamic  parameters  of 
the  main  procedure.  Information  about  being  dynamic  is  propagated  along  the  fimctional  dependencies 
of  the  procedures.  Whenever  an  argument  in  a  procedure  call  is  dianged  to  dynamic,  the  corresponding 
parameter  becomes  dynamic  as  well,  and  the  procedure  body  is  reanalyzed. 

In  the  P:Q  example,  if  procedure  P  calls  itself  recursively  in  such  a  way  that  the  second  argument  y 
d^ends  on  the  third  z,  and  the  first  argumait  x  does  not  depraid  on  y  or  on  z,  x  is  static  and  y  and  z  are 
dynamic. 

Partial  evaluation  proper 

The  second  phase  is  the  partial  evaluation  proper,  that  is  the  generation  of  specialized  procedures  when 
known  values  are  given.  Static  parameters  of  a  procedure  may  take  several  different  values  during  this 
process.  Respectively,  several  specialized  versions  of  the  same  procedure  are  generated. 

The  only  requirement  is  that  the  number  of  different  values  of  each  parameter  must  be  finite.  If  this 
does  not  hold,  the  process  does  not  finish,  the  user  should  stop  it  after  waiting  for  some  time,  and  study 
the  dump  to  find  an  argument  that  takes  infinite  number  of  values,  and  forcefully  declare  it  dynamic. 
Then  BTA  and  partial  evaluation  are  repeated. 

Partial  evaluation  as  a  case  of  supercompilation 

The  role  of  the  distinction  between  static  and  dynamic  parameters  can  be  explained  in  terms  of 
generalization  in  superconqrilation  as  follows: 

•  static  arguments  are  never  generaiizea; 

•  dynamic  arguments  are  always  generalized  to  a  configuration  variable. 

The  user  can  turn  static  parameters  or  subejqpressions  to  dynamic  by  hand,  thus  controlling 
gmiBraliratinn  Sin^licity  is  the  main  benefit  and  the  main  limitaticHi  of  partial  evaluation. 

Benefits  and  limitations  of  partial  evaluation 

If  something  can  be  done  by  a  simpler  method,  it  should  be  done  by  it.  A  great  discovery  by  the  authors 
of  partial  evaluation  was  that  it  is  sufficient  to  convert  interpreters  to  compilers,  solving  a  large  class  of 
practical  problems.  The  main  benefit  of  partial  evaluation  is  that  it  allows  clear  control  by  a  user  who 
is  developing  an  interpreter  and  allows  experiments  with  specializing  it: 

•  BTA  results  are  simple  for  a  user  to  read  and  to  understand.  They  are  just  comments  annotating  the 
source  program.  A  user  may  increase  the  level  of  generalization  (residualization)  by  marking  known 
(static)  variables  and  subexpressions  as  unknown  (dynamic). 

•  Termination  properties  are  clear  to  a  user:  whether  evaluation  of  static  subexpressions  results  in  a 
finite  or  infinite  set  of  values  of  procedure  arguments. 

»  Termination  is  easily  controlled  by  a  user:  if  a  procedure  argument  takes  on  an  infinite  set  of  values, 
the  user  should  forcefully  mark  it  as  dynamic. 

The  main  limitations  of  partial  evaluation  are: 

•  It  does  not  try  to  generalize  complex  configurations  than  BTA  did  not  pick  in  advance. 
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•  It  processes  procedures  and  threads  separately  and  cannot  fuse  them  (monogenetic). 

•  It  evaluates  only  subexpressions  belonging  to  a  functional  subset. 

Composite  system  of  partial  evaluation  and  supercompilation 

We  have  two  powerful  program  transformation  methods  which  can  be  implemaited  for  Java: 

•  One  is  more  general,  powerful  and  complex:  supercompilation; 

•  The  other  is  less  powerful,  but  simpler  to  understand  and  to  use:  partial  evaluation. 

Supercompilation  is  not  a  just  a  single  method.  It  is  a  series  of  methods,  each  based  on  previous  ones. 
However,  in  the  nearest  future,  we  plan  to  implement  a  prototype  of  just  the  basic  supercompiler, 
discussed  in  this  r^ort.  A  practical  program  transformation  system  should  contain  program 
transformers  of  different  power  and  with  different  features.  It  would  be  advantageous  to  have  both 
partial  evaluation  and  supercomp ilati<xi  for  Java. 

In  such  a  dual  system,  the  two  program  transformers  should  be  used  in  the  following  order: 

1 .  A  program  should  be  optimized  by  the  sin^ler  one,  partial  evaluation; 

2.  The  supercompiler  should  be  applied  to  the  result  of  partial  evaluation. 

The  supercompiler  can  perform  the  work  of  partial  evaluator  as  well,  but  the  later  can  do  this  more 
efficiently,  especially  when  the  subject  program  is  still  under  developmait  and  its  author  needs  to 
perform  a  series  of  experiments  to  understand  how  in  behaves  under  program  transformation.  Having 
two  program  transformers  for  one  practical  language,  we’ll  have  a  unique  opportumty  to  compare  these 
and  get  invaluable  experience  in  using  differrait  techniques  for  constructing  libraries  of  reusable 
software. 

However,  we  plan  first  to  concentrate  on  basic  supercompiiation.  if  we  nave  support  enough,  a  java 
partial  evaluator  will  be  developed  by  another  researcher. 
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Configuration  Representation 


A  configuration  is  a  generalized  state  of  a  machine  execxiting  a  source  program.  It  resembles  a 
program  state  representation  in  a  Java  interpreter,  but  contains  configuration  variables. 


We  graphicaUy  denote  crmfiguratiois  as  follows. 


Figure  1 


Threads: 


'‘thread  name” 

©  or  © 

procedure  name 

program  point 

variable  name 

variable  value 

procedure  name 

program  point 

variable  name 

variable  value 

Objects: 


class  name 

D  or  £7 

variable  name 

variable  value 

(sub)class  name 

variable  name 

variable  value 

1 

(sub)class  name 

variable  name 

variable  value 

configuration  variable  (optional) 

class  name 

C2  or 

admission  queue 

list  of  ref  to  threads 

wait  queue 

list  of  ref  to  threads 

variable  name 

variable  value 

(sub)class  name 

variable  name 

variable  value 

Object  with  synchronized  methods 


Restrictions:  set  of  inequalities 


More  formally,  and  in  more  detail,  the  rqrresentation  of  a  configuration  is  defined  as  follows: 


configuration 
thread 
stack  frame 
object 

variable  binding 

variable  value 

ground  value 

simple  type 
value 


=  (set  of  threads,  set  of  objects,  set  of  restrictions) 

=  (thread  name,  active  ©  or  passive  ©,  list  of  stack  fi^mes) 

=  (procedure  name,  program  point,  list  of  variable  bindings) 

=  (list  of  (class  name,  list  of  variable  bindings)  optionally  ended  by  a  configuration  variable, 
safe  Cli  or  unsafe  £/) 

=  (variable  name,  variable  value) 

=  ground  value  or  configuration  variable 

=  simple  type  value  or  reference  to  an  object 

=  integer  or  floating  point  number  or  character  or  Boolean 
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Configuration 

A  configuration  is  a  triple  of  a  set^  of  threads,  a  set  of  objects  and  a  set  of  restrictirais. 

Threads  and  objects  are  identified  by  references.  References  to  threads  can  be  assigned  to  variables  of 
the  built-in  class  Thread  and  to  an  admission  queue  and  a  wait  queue  that  are  special  variables 
automatically  added  to  an  object  with  synchronized  methods.  References  to  objects  are  used  as  variable 
values.  We  denote  a  reference  by  a  dot  with  an  arrow  going  to  the  respective  object  or  thread.  We  have 
no  reason  to  treat  arrays  separately  as  it  is  done  in  the  Java  specification,  and  interpret  these  like 
instances  of  a  built-in  class,  say  Array,  with  the  obvious  definition. 

Threads 

A  thread  consists  of  a  thread  name,  a  Boolean  tag,  ©  or  ©  marking  wdiether  the  tread  is  active  or 
passive,  and  a  stack  of  procedure  calls  (a  list  of  stack  frames).  The  thread  name  is  a  string  passed  as 
an  argument  to  the  Thread  constructor.  We  use  it  as  a  comment  to  clarify  the  example  (e.g. 
“mainThread”,  “Producer”,  “Consumer”).  A  thread  is  passive  if  the  reference  to  it  occurs  in  an 
admission  or  wait  queue.  Otherwise  it  is  active.  Java  semantics  allow  a  thread  to  wait  only  in  one 
queue.  An  admission  queue  lists  the  threads  waiting  to  enter  a  synchronized  method.  The  threads  that 
have  executed  waitO  and  have  not  received  notifyOstand  in  a  wait  queue. 

A  stack  frame  consists  of  a  procedure  name,  a  program  point  and  a  list  of  bindings  that  are  pairs  of  a 
variable  name  and  a  variable  value.  In  our  examples,  the  program  point  is  a  number  or  a  letter  E  that 
can  be  found  in  the  program  text  to  the  right  of  the  statement;  it  denotes  a  point  that  has  been  just 
executed.  If  a  procedure  is  not  static,  then  the  first  binding  is  for  the  variable  this.  That  is,  we  consider 
a  method  call  of  form  obiect  .methodfargs)as  a  procedure  call  of  form  class.method(object.args),  where 
the  method  in  denned  in  the  class. 

Objects 

The  main  contoits  of  an  object  are  variable  names  and  values.  A  real  supercompiler  may  avoid 
spanding  memory  for  variable  names  by  renaming  variables  to  ordinal  munbers.  These  are  organized  in 
a  list  of  subclasses:  first,  the  top  class  name  and  its  variables,  second,  the  class  name  of  a  first  inheritor 
and  the  variables  defined  in  h,  then  the  second  inheritor,  and  so  on.  The  hst  of  subclasses  may  be  ended 
by  a  configuration  variable  (see  below). 

An  object  has  an  attribute,  safe  or  unsafe,  denoted  in  figures  as  Cj  and  Its  use  is  as  follows.  A 
configuration  represents  part  of  a  run-time  residual  program  state.  Other  runtime  threads  may  refer  to 
the  objects  of  a  current  configuration  and  may  change  them  as  well.  Hence,  a  supercompiler  must  be 
able  to  imply  that  variable  values  may  change  at  any  moment,  that  their  object  is  unsafe.  However,  if  it 
knows  that  an  object  is  referenced  only  from  the  current  configuration,  it  know  that  its  values  change 
only  explicitly  during  driving.  Such  an  object  is  marked  in  a  configuration  as  safe.  Tne  safe  attribute  is 
present  only  at  (super)compile  time  and  takes  no  space  at  run-time.  In  our  examples  ail  objects  are  safe. 
See  the  definition  of  driving  for  details  of  keeping  the  unsa  fe  attribute. 


*  A  set  is  an  unordered  finite  list.  Of  course,  in  a  computer,  sets  are  represented  by  lists  with  some  fixed  order. 
By  saying  ‘sef  ’  we  imply  that  its  order  does  not  matter  as  well  as  that  we  must  check  different  orders  of 
corresponding  lists  while  comparing  configurations  during  supercompilation. 
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Configuration  variables 

The  supercompiler  computes  with  partially  unknown  data.  The  substitutes  for  unknowns  are  referred  to 
as  conflgurcition  variobles.  A  configuration  variable  may  occur  instead  of  any  variable  value  in  threads 
and  objects.  We  denote  configuration  variables  by  italic  identifiers  with  subscripts:  mi,  etc.  For 
clarity  in  exan^les,  we  use  the  same  identifier  for  a  amfiguration  variable  as  the  name  of  the  program 
variable,  which  supplies  the  initial  value  for  that  configuration  variable.  A  configuration  variable  may 
be  copied  and  assigned  to  any  thread  or  object  variable,  and  may  occur  in  several  places.  This 
represents  information  about  the  equality  of  values  in  different  places.  If  a  configuration  variable 
remains  instead  of  a  ground  value  \^diena  value  must  be  known  to  perform  an  operation  (e.g.  in  the  if 
COTStruct),  an  appropriate  statemait  is  put  into  the  residual  program.  The  new  configuration  may  be 
either  the  same  as  before,  or  changed.  In  particular,  a  configuration  variable  may  be  contracted,  that  is 
replaced  by  an  expression  which  represents  information  that  has  become  known.  See  the  definition  of 
driving  for  details. 

One  more  case  of  an  xmknown  in  a  Java  configuration  is  the  tail  of  a  list  of  subclasses  in  the  object 
representation.  When  an  object  has  been  passed  to  a  procedure  as  a  parameter,  say  x,  of  a  not  final 
class  C,  it  may  actually  belong  either  to  C,  or  to  some  subclass  C  of  C  This  is  not  known  until  x  is 
tested  by  x.isC  ’0  (or  some  other  Java  method).  Before  the  test,  the  uncertainty  is  represented  by  a  tail 
configuration  variable.  After  the  test,  the  configuration  variable  is  contracted  and  (on  the  positive 
branch)  replaced  by  a  structure  corresponding  to  the  subclass  C  and  a  new  tail  configuration  variable 
if  C  ’  is  not  final  as  well. 

Restrictions 

Information  about  configuration  variables  is  propagated  by  the  supercon:q)iler  by  two  means.  First,  by 
substitution  to  reflect  an  equality:  a  configuration  variable  may  be  replaced  by  a  ground  value  or 
another  configuration  variable.  This  operation  is  referred  to  as  contraction  ot  the  vanabie.  inis 
happens  when  it  becomes  known  that  a  variable  equals  to  a  constant  or  another  configuration  variable. 

In  the  special  case  of  a  tail  variable,  the  replacement  is  a  structure  as  just  described. 

The  opposite  case  of  inequality  is  inqiossible  (or  rather,  inconvenient)  to  represent  by  a  substitution. 
Inequalities  are  collected  as  restrictions  to  the  set  of  states  represented  by  a  configuration. 

In  principle,  the  supercompiler  can  work  and  produce  meaningful  results  without  keeping  restrictions. 
The  result  would  be  less  accurate,  the  residual  program  would  be  larger  and  less  efficient,  but  correct. 

It  is  the  essence  of  supercompilation  that  the  ideal  result  cannot  in  general  be  achieved,  and  from  time  to 
time  a  supercompiler  must  forget  some  restrictions,  or  perform  some  other  kind  of  generalization,  in 
order  to  construct  a  finite  residual  program. 

The  class  of  restrictions  to  keep  with  configurations  and  the  algorithm  to  compare  configurations  while 
accounting  for  restrictions  are  a  matter  of  heuristic  choice  by  a  supercompiler  designer.  Keeping  in 
mind  a  particular  class  of  tasks,  we  plan  to  implement  the  following  restrictions  in  the  Java 
supercompiler: 


variable 

< 

constant 

variable 

< 

variable 

variable 

< 

constant 

variable 

< 

variable 

variable 

constant 

variable 

r— 

variable 

variable 

class 

name 
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Producer-Consumer  Example 
Introduction 

Threads  are  used  in  imperative  languages  like  Java  not  only  for  load-balancing  multiprocessor  systems, 
but  for  well-structured  programs  in  object  oriented  paradigms.  However,  a  higjily  structured  program  is 
often  less  efficient  on  a  monoprocessor  or  cm  a  system  where  the  number  of  processors  is  significantly 
less  than  the  number  of  threads  in  the  program.  The  following  example  demcmstrates  that 
supercompilation  can  fose  threads  anci,  in  particular,  transform  a  well-structured  multithreaded  program 
into  a  single-threaded  efficient  one. 

Example 

Consider  a  traditional  example  of  a  consumer  and  a  producer  communicating  via  a  queue  objert.  The 
example  is  based  cm  one  fi-om  the  “Java  Handbook”  by  Patrick  Naughton,  Osborn  McGraw-Hill, 
pp.  191-194.  The  program  below  consists  of  a  class  Queue  that  implements  a  queue  of  integers,  a  class 
Producer,  a  class  Consumer^  and  a  procedure  doit  that  constructs  Queue,  Producer  and  Consumer 
instances.  Two  new  threads  start  in  Producer  and  Consumer. 

Class  Queue 

The  class  Queue  has  2  operations: 

void  ptit(int  n); 
int  getO; 

The  first  operation  puts  a  integer  into  a  queue.  The  second  operation  gets  the  integer  from  the  queue. 
The  queue  can  hold  at  most  one  element. 

class  Queue  { 
intn; 

boolean  valueSet  =  folse; 
synchronized  void  put(int  n)  {  0 

if  (valueSet)  1 

try  waitO;  catch(InterruptedException  e);  2 

this.n  =  n;  3 

valueSet = true;  4 

notifyO;  E 

} 

synchronized  int  getQ  {  0 

if  (IvalueSet)  1 

try  waitO;  catch(InterruptedException  e);  2 

valueSet  =  false;  3 

notifyO;  ^ 

return  n;  E 

} 

}  ' 


^  The  class  Queue  is  a  copy  of  the  class  Q  on  p.  194  in  all  but  its  name.  The  classes  Producer  and  Consumer 
differ  slightly  from  that  on  p.  192  in  the  procedure  run. 
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This  is  a  general  purpose  class  which  may  be  thought  of  as  belonging  to  a  library.  The  following  two 
classes  are  simple  examples  of  a  producer  and  a  consumer,  which  may  be  thought  of  as  being  written  by 
an  end  user. 


Class  Producer 


A  Producer  reads  k  integers  by  calling  a  function  My.readint,  and  puts  them  into  the  Queue  supplied  by 
the  parameter  q.  It  starts  a  separate  thread. 

class  Producer  iiiq)lemaits  Ruimable  { 

Queue  q; 
int  k; 

Producer(Queue  q,  int  k)  {  0 

this.q  =  q;  1 

this.k  =  k;  2 

new  Thread(this,  "Producer").startO;  E 

} 

pubUc  void  runO  throws  lOException  {  0 

for  (int  i=l;  I 

i<=k;i-H-)  {2 
int  n  =  My.readIntO;  3 
q.put(n);  4 

}  E 


} 


} 


Class  Consumer 

A  Consumer  takes  k  integers  from  the  Queue  supplied  by  the  parameter  q,  applies  a  function  My.F  to 
them  and  outputs  the  result  by  calling  a  function  My.writeint  (the  class  My  and  the  fimctions 
My.readint,  My.F,  and  My.writeint  are  not  specified  at  supercompile  time).  It  starts  a  separate  thread. 

class  Consumer  implemmts  Runnable  { 

Queue  q; 
int  k; 

Consumer(Queue  q,  int  k)  {  0 

this.q  =  q;  1 

this.k  =  k;  2 

new  Thread(this,  "Consumer"). start();  E 


public  void  runQ  { 
for  (int  i=l; 


i<=k;  1+-^)  !  2 
int  n  =  q.getO; 
int  m  =  My.F(n): 
Mv.writelnt(m); 


0 

1 


4 
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The  class  ProduceConsume 

The  task  for  the  supercompiler  is  to  qptimize  a  procedure  doit  in  a  class  ProduceConsume,  which 
demonstrates  a  particular  use  of  the  Queue,  Producer  and  Consumer  classes.  It  supplies  the  number  of 
integers  to  produce  and  consume,  k,  to  Producer  and  Consumer  instances  .  Below,  3  versions  of  the 
doit  procedure  and  respective  expected  results  of  supercompilation  (computed  manually  keeping  in  mind 
a  particular  supercompilation  strategy)  are  presented. 

The  1“  and  versions  are  substantionally  the  same  case:  these  differ  only  in  the  order  of  the  Producer 
and  Consumer  are  constructed  and  the  respective  threads  are  started.  Implying  that  the  supercompiler 
uses  a  particular  order  of  evaluation  {driving)  of  threads  (we  use  “elder  threads  are  evaluated  earlier”), 
the  residual  programs  differ  in  the  order  of  calls  to  My.readint  and  My.writeint.  The  essence  of  these 
versions  is  that  the  loop  parameter  is  static,  k=4.  Hence,  the  residual  program  has  sin:q)le  structure:  it  is 
linear.  The  main  problem  supercompiler  must  solve  here  is  not  to  stop  too  early.  Otherwise,  the  threads 
would  not  be  fully  fused  into  one  as  in  the  expected  results  of  supercompilation  shown  below.  The 
criteria  to  continue  driving  that  has  been  used  during  manual  supercompilation  is  natural:  the  loq)S  are 
controlled  by  inequalities  of  form  i<=k,  which  involve  no  configuration  variables,  and  the  difference 
between  the  compared  integers  decreases. 

The  3'“*  version  differs  essentially:  the  loop  parameter  k  is  dynamic.  Hence,  the  residual  program 
contains  a  loop  on  k  (the  criteria  above  does  not  work,  since  the  loop  conditions  have  configuration 
variables).  The  main  problem  supercompiler  must  solve  here  is  to  catch  a  moment  when  to  construct  a 
basic  configuration,  that  is  a  point  in  the  residual  program  to  loop  back.  The  algorithm  to  estimate  2 
configurations  as  being  similar  and  to  generalize  ±em  is  based  on  the  paper  [Turchin  88].  The  idea  is 
that  to  estimate  similarity  only  the  form  of  thread  stacks  is  compared,  while  the  values  of  program 
variables  are  ignored.  However,  all  values  are  used  to  construct  the  least  general  configuration. 

Manual  supercompilation  of  the  3"*  version  is  shown  below. 

version:  Static  loop  parameter;  Producer  starts  before  Consumer 
Class  ProduceConsume 


class  ProduceConsume  { 

public  static  void  doitQ  {  0 

int  k  =  4;  1 

Queue  q  =  new  QueueQ;  2 

new  Producer(q,  k);  3 

new  Consumer(q,  k);  E 

} 

} 
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Expected  result  of  supercompilation 

class  ProduceConsume  { 

public  static  void  doitO  { 
int  nl  =  My.readIntO; 
int  n2  =  My.readIntO; 
int  n3  =  My.readIntO; 
int  ml  =  My.F(nI); 
My.writelnt(ml); 
int  n4  =  My.readIntO; 
int  m2  =  My.F(n2); 
My.writelnt(m2); 
int  m3  =  My.F(n3); 
My.writelnt(m3); 
int  m4  =  My.F(n3); 
My.writelnt(m4); 

} 

} 


version:  Static  loop  parameter;  Consumer  starts  before  Producer 

Ciass  ProduceConsume 


class  ProduceConsume  { 

public  static  void  doitO  {  0 

int  k  =  4;  1 

Queue  q  =  new  QueueO;  2 

new  Consumer(q,  k);  3 

new  Producer(q,  k);  E 
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Expected  result  of  supercompilation 

class  ProduceConsume  { 

public  static  void  doitO  { 
int  nl  =  My.readIntO; 
int  ml  =  My.F(nl); 
My.writelnt(ml); 
int  n2  =  My.readIntO; 
int  m2  =  My.F(n2); 
My.writelnt(m2); 
int  n3  =  My.readIntO; 
int  m3  =  My.F(n3); 
My.writelnt(m3); 
int  n4  =  My.readIntO; 
int  m4  =  My.F(n4); 
My.writelnt(m4); 

}  } 


y®'  version:  Dynamic  ioop  parameter;  Consumer  starts  before  Producer 
Class  ProduceConsume 

class  ProduceConsume  { 

public  static  void  doit(int  k)  {  0 

Queue  q  =  new  QueueO;  1 

new  Consumer(q,  k);  2 

new  Producer(q,  k);  E 

}  } 

Expected  result  of  supercompilation 

class  ProduceConsume  { 

public  static  void  doit(int  k)  { 
for  (int  i=l;  i<=k;  i-H-)  { 
int  nl  =  My.readIntO; 
int  ml  =  My.F(nl); 

My .  writelnt(m  1 ); 

}  }  } 

Two  interacting  threads  have  been  replaced  by  a  single  thread.  Supercompilation  has  found  one 
efficient  loop  that  is  equivalent  to  the  entire  interaction! 
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Producer-Consumer  Example:  Supercompilation  Trace 

In  this  secticm  the  main  steps  of  the  supercompilation  process  for  the  Producer-Ccmsumer  example  are 
presented.  See  previous  section  for  the  text  of  the  Java  program.  ^Ve  consider  the  3  version  of  the 
example:  dynamic  loop  parameter;  Consumer  starts  before  Producer.  Here  is  the  reminder  of  the 
procedure  to  be  supercompiled: 

C/ass  ProduceConsume 

class  ProduceConsume  { 

public  static  void  doit(int  k)  {  0 

Queue  q  =  new  QueueO;  1 

new  Consumer(q,  k);  2 

new  Producer(q,  k);  E 

} 

} 

Initial  configuration 

Supercompiler  starts  with  one  (or  more)  initial  configuration(s).  If  we  are  interested  in  optimizing  some 
procedure,  the  initial  configuration  represents  the  call  to  it  with  all  parameters  bound  to  distinct 
configuration  variables.  In  our  case,  this  is  the  call  ProduceConsume.doit(*i),  where  ki  is  a 
configuration  variable  that  stands  for  an  unknown  value  of  the  parameter  k. 

_ Figure  2 


Threads: 


'"mainihrcad" 

ProduceC  onsume.  doit 

0 

k 

IS 

Objects:  none 

Restrictions:  none 


Configuration  at  the  end  of  the  Producer  constructor 

While  configuration  variables  do  not  interfere,  the  supercompiler  perfijrms  transient  driving  steps  that 
execute  a  program  like  a  common  interpreter.  For  Java,  transient  driving  imitates  execution  of  a 
generally  multithreaded  program  on  a  single  processor.  Some  particular  order  to  evaluate  threads  must 
be  chosen.  Any  order  is  acceptable,  but  the  residual  program  may  differ  depending  on  the  order.  This 
e?cample  uses  die  order  “eldest  thread  executes  first".  When  the  next  step  of  die  eldest  thread  cannot  be 
executed,  either  because  it  has  come  to  a  passive  state,  or  because  a  configuration  variable  needs  a 
value  used  to  select  one  branch  of  a  conditional  statement,  the  next  younger  thread  executes,  and  so  on. 
After  transient  driving  of  all  threads  has  completed,  a  branching  in  the  residual  program  is  generated, 
and  the  process  goes  further  along  one  branch,  then  another,  as  needed.  Branching  details  are  discussed 
below.  First,  we  will  see  how  the  supercompiler  performs  its  initial  transient  driving  steps,  vi^ich  are 
equivalent  to  the  steps  of  an  interpreter. 
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Producer-Consumer  Example:  Supercompilation  Trace 


Figure  3  shows  the  configuration  when  mainThread  has  come  to  the  end  of  the  procedure 
Producer.Producer  at  program  point  E,  as  mariced  to  the  ri^t  of  the  program  text.  The  mainThread 
dies  after  procedures  Producer.Producer  and  ConsumeProduce.doit  return.  The  threads  Consumer  and 

Producer  have  just  been  created  and  are  standing  at  the  beginning  of  their  respective  runQ  procedures. 

_ Figure  3 


Threads: 


Restrictions:  none 


After  mainThread  dies,  the  newly  eldest  thread  Consumer  starts  executing  and  enters  its  loop  .  Its 
integer  variable  i  is  initialized  to  1  and  the  loop  condition  i<=k  must  be  checked.  After  substituting  the 
values  i=l  and  k=yti,  the  supercon^iler  tries  to  evaluate  l<=ii  and  notices  that  configuration  variable  ki 
does  not  allow  a  branch  to  be  choosen,  so  transient  driving  stops  for  thread  Craisumer.  Thread 
Producer  starts  executing  and  stops  in  a  sumlar  state  when  it  tries  to  check  i<— k,  as  1<— at  the 
beginning  of  its  loop.  The  two  threads  are  evaluating  different  procedures.  Consumer. run  and 
Producer  .run,  and  are  checking  different  i  and  k  variables,  which  just  coincidentally  are  called  by  the 
same  local  names.  Their  full  names  are  Consumer.run.i  and  Producer.run.i,  Consumer.run.this.k  and 
Producer.run.this.k.  Coincidentally,  the  is  and  ks  have  the  same  values.  1  and  respectively.  The 
supercompiler  will  notice  these  coincidences  and  generate  efficient  code. 
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Basic  configuration:  Consumer  and  Producer  to  check  ioop  condition  i  <=  k 

_  Figure  4 


Threads: 


Restrictions:  none 


Residual  program  constructed  till  now: 

public  static  void  doit(int  k)  { 

//  Initial  configuration. 

il  =  I; 

//  Basic  configuration. 

Ml: 

//  ...  (Dots  denote  the  residual  program  is  not  vet  constructed  here). 

} 


The  configuration  in  Figure  4  corresponds  to  the  start  of  the  loop  in  the  residual  program. 

Configurations  that  have  two  or  more  predecessors  in  the  residual  program  graph  are  called  basic. 

They  mark  the  start  of  new  basic  blocks,  sequential  code  sequences  amenable  to  simple  transient  driving 
and  points  where  variables  used  in  two  or  more  control  threads  must  be  made  compatible.  Constructing 
basic  configurations  is  the  main  task  of  a  supercompiler.  It  is  the  most  complex  problem  in 
supercompilation.  Basic  configurations  emerge  as  a  result  of  loops. 

In  this  example,  when  the  supercompiler  first  comes  to  this  point,  there  is  no  configuration  variable  /i. 
The  number  1  is  the  value  for  each  local  i  variable.  The  supercompiler  continues  (details  of  how  this 
happens  are  shown  below),  executes  the  first  iteration  of  the  loops  in  Consumer  and  Producer,  and 
comes  to  a  similar  configuration,  but  with  constant  2  for  each  /.  The  supercompiler  must  stop  iterating 
and  generalize  two  similar  configurations.  The  heart  of  tlie  problem  for  generatmg  residual  program 
loops  is  recognizing  similar  configurations. 

Deciding  whetiier  to  stop  driving  after  having  met  similar  configurations  is  tlie  most  complex  problem  of 
supercompilation.  The  basic  method  shown  in  this  example  and  planned  for  our  Java  implementation 
comes  from  the  paper  [Turchin  88].  This  method  of  comparing  and  generalizing  configurations  was 
developed  for  lazy  evaluation  rather  than  for  multiple  threads  run  in  applicative  order.  However,  with 
small  changes  it  can  also  be  used  for  Java  threads. 
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Producer-Consumer  Example:  Supcrc('»mpilation  Trace 


We  do  not  discuss  the  general  case  here,  just  an  idea  of  how  it  works  for  this  example.  Configurations 
with  stack  frames  that  have  the  same  structure  but  different  variable  values  are  considered  similar. 

Whai  the  current  configuration  is  similar  to  a  previous  one,  the  supercompiler  stops,  and  constructs  the 
least  common  generalization  of  the  configuratitMis.  It  throws  away  the  parts  of  the  residual  program 
graph  that  start  with  the  old  configuration,  appends  the  generalized  configuration  the  remaining  program 
graph,  and  puts  an  assignment  of  new  configuraticm  variables  just  before  the  new  generalized 
configuraticwi. 

Figure  4  shows  Ae  least  common  generalization  of  two  similar  configurations  wdiich  differed  in  two 
places  where  Ae  first  had  constant  1  instead  of  variable  /'i,  Ae  current  had  constant  2.  The  assignment 
il  =  1  is  added  to  Ae  residual  program  in  generalizing  Ae  old  configuration. 


Configuration  when  Consumer  waiting  and  Producer  at  start  of  loop  body 

_ Figures 


Threads: 


Residual  program: 

public  static  void  doit(int  k)  { 

//  Initial  configuration, 
il  =  1: 

,7  Basic  coniiguralion. 

Ml:  if  (!(il<=kl))  goto  M2; 

7  Current  configuration:  like  the  basic  one  but  with  restriction  {/.  <  A'l }. 

:/  Conilguration  like  the  basic  one  but  with  restriaion  {/■,  >  k,, } . 

'  M2: 

//  ... 

> 
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Between  the  basic  configuration  in  Figure  4  and  the  configuration  in  Figure  5,  the  supercompiler 
reasons  as  follows.  The  next  step  after  the  basic  configuration  cannot  be  executed  unambiguously 
because  of  configuration  variables  in  the  condition  /i  ^  k\.  In  such  a  (general)  case,  driving  is 
performed  as  follows.  The  if  statement  with  an  unevaluated  condition  is  put  into  the  residual  program, 
and  the  process  of  supercompilation  continues  along  one  of  the  branches.  When,  sooner  or  later,  it  has 
finished,  the  supercompiler  will  continue  driving  of  the  second  branch.  In  our  example,  the  if  statement 
in  the  residual  program  looks  as  follows:* 

if  (!(il<=kl))  goto  M2; 

//  Positive  branch  for  condition  ii<ki 
M2:  //  Negative  branch 

At  best,  configurations  on  the  branches  should  represent  the  exact  sets  of  states  into  wdiich  a 
conditional  statemait  has  split  a  configuration.  At  worst,  driving  of  each  branch  may  continue  fi’om  the 
same  configuration.  This  is  only  a  question  of  the  quality  of  the  residual  program.  There  is  a  general 
rule  of  supercompilation:  “At  any  moment,  a  configuration  may  be  replaced  by  a  more  general  cme, 
reducing  the  quality  of  the  residual  program,  but  preserving  correctness.” 

The  planned  implementation  of  a  supercompiler  for  Java  will  be  able  to  represent  the  exact  splitting  of 
the  configuration  in  this  example.  The  restriction  {/i  /rj}  is  added  to  the  configuration  on  the  positive 
branch,  and  {/i  >kx}  on  the  negative. 

The  supercompiler  continues  driving  the  positive  branch  and  executes  the  Consumer  thread  until  it  has 
called  waitO  in  the  procedure  Queue.run  and  has  been  put  into  the  wait  queue.  This  moment  is  shown 
in  Figure  5. 


*  Although  Java  has  no  goto  statement,  we  use  it,  since  it  is  a  natural  means  to  represent  an  arbitral  graph  of  a 
residual  program  as  a  plain  te.\t.  We'll  use  a  postprocessor  to  translate  gotos  to  legitimate  Ja\a  constructs. 

(See  discussion  below). 
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Configuration  when  Producer  notifies 


Figure  6 


Residual  program: 

public  static  ^'oid  doittint  k)  { 

//  Initial  configuration. 
il=  1; 

//  Basic  configuration. 

Ml:  if(!(il<=kl))gotoM2; 

//  Configuration  like  the  basic  one  but  with  restriction  {/j  <  kx }. 

int  nl  =  My.readIntO; 

//  Current  configuration. 


// 

//  Configuration  like  the  basic  one  but  with  restriction  {I’l  >  kx }. 

M2: 

//  ... 


While  the  Consumer  thread  is  passive,  the  Producer  thread  executes.  The  supercompiler  behaves  like 
an  interpreter  until  it  meets  the  statement  int  n  =  My.readIntO.  Tlie  procedure  My.readint  is  ^known 
at  supercompile  time,  and  we  have  nothing  to  do  but  to  represent  tiie  '.-aiue  of  n  by  a  new  conngiiration 
variable  >i\,  and  to  generate  the  statement  int  n  1  =  My.readIntO. 

Then  the-Producer  goes  ftirtlier  and  steps  when  it  calls  notif-O  and  releases  the  Consumer.  The  current 
configuration  is  shown  in  Figure  6. 


Configuration  when  Consumer  to  check  loop  condition  for  the  second  time 

_ Figure  7 


Threads: 


Objects: 


“Consumer” 

© 

Consumer.run 

1 

this 

J _ _ _ / 

^ii+1 

Queue 

Cj 

Consumer 

a 

admission  queue 

[1 

_a _ 

wait  queue 

n 

k 

Vk,  1 

n 

n\ 

— - - 

- 

valueSct 

false 

i-* - 

Producer 

Q 

q 

J- 

Restrictions:  {/ 1  <  } 


Residual  program: 

public  static  void  doit(int  k)  { 

//  Initial  configuration, 
il  =  1; 

//  Basic  configuration. 

Ml:  if{!(il<=kl))goloM2; 

int  nl  =  My.readlntO; 
int  ml  =  My.F(nl); 

My.writelnt(ml); 

//  Current  configuration. 

II  ... 

//  Configuration  like  the  basic  one  but  with  rcstnction  i  >  A , ,  ■ 
M2: 

// 

>  _ 


After  the  Consumer  thread  has  been  released  by  notifyO,  two  statements  calling  the  procedures  My.F 
and  My.writeint  are  added  to  the  residual  program,  since  the  procedures  are  unknown.  A  new 
configuration  v'ariable  Wi  represents  the  unknown  value  of  program  vanable  m. 

At  the  end  of  the  Consumer's  loop,  the  variable  i  =  /,  must  be  incremented  by  1.  The  supercompiler  can 
behave  m  different  wavs.  At  simplest,  it  applies  the  general  mle:  when  a  statement  or  an  expression 
cannot  be  evaluated,  a  new  configuration  variable  (say  /,)  is  used  to  represent  its  result  and  an 
assignment  to  it  is  added  to  the  residual  program: 


inti2  =  il+l: 

Then  h  becomes  the  value  of  the  program  variable  Consumer.run.i. 

A  smarter  supercompiler  can  postpone  generating  the  residual  statOTent  and 

value  by  a  symbolic  expression  1,+1.  Uie  class  of  expressions  to  be  manipulated  by  the  supercompiler 


must  be  chosen  by  the  designer  of  supercompiler  for  a  particular  language  and  data  types.  We  plan  to 
take  this  decision  for  Java  after  a  series  of  experiments.  For  integers,  it  seems  reasonable  to  support 
linear  expressions,  ct  x  b,  where  o  and  b  are  integer  ccxistants  and  x  is  a  CMifiguration  variable. 

Figure  7  shows  the  current  configuration  for  the  second  method:  the  value  of  CiMisumer.nm.i  is  /i+l. 

The  supercompiler  continues  transient  driving  of  the  thread  Consumer,  reaches  the  begiiming  of  the  loq) 
ftjr  the  second  time,  and  stqps,  since  the  loop  condition  i<=k  evaluates  to  / i+l  <ku  wiiich  involves 
configuration  variables.  The  current  configuration  does  not  change  and  is  still  as  in  Figure  7. 


Configuration  when  Consumer  and  Producer  start  their  loops  again 

After  the  Consumer  thread  has  been  suspended  by  configuration  variables,  the  supercompiler  switches 
to  the  Producer  thread  and  drives  it  until  it  comes  to  the  beginning  of  the  loop.  On  the  way,  the  program 
variable  Producer .run.i  has  been  incremented  and  has  taken  cm  the  value  /  i+l  just  as  explained  for 
CcMisumer.run.i . 

When  both  loops  restart,  the  current  configuration  looks  like  the  basic  one  in  Figure  4  except  ft>r  the 
values  of  two  program  variables,  Ccmsumer.run.i  and  Producer.run.i,  and  restrictions.  The  values  of 
both  are  /i  in  the  basic  configuraticm  and  ii+l  in  the  currait  configuration.  There  is  no  restriction  in  the 
basic  configuratirai,  and  {h  <  iti}  in  the  current  one.  To  loq)  a  currait  configuraticm  back  to  a  basic 
one,  the  set  of  states  the  current  one  represOTts  must  be  a  subset  of  the  set  of  states  of  ftie  basic  one. 

This  holds  when 

1)  there  exists  a  substitution  mapping  variables  of  the  basic  configuration  to  expressions,  such  that  it 
transforms  the  threads  and  objects  of  the  basic  configuration  to  those  of  the  current  wie,  and 

2)  the  set  of  restrictions  of  the  basic  configuration  is  a  subset  of  the  restrictions  of  the  current  (me. 

In  the  example,  the  first  requirement  is  satisfied  by  a  substitution  {ii  i-^  I'l+l);  the  second  also  holds. 
The  substitution  becomes  an  assignment  statement  of  the  residual  program: 
il  =  iH-l; 

By  adding  the  assignment  and  the  goto  Ml  to  the  residual  program,  the  supercompiler  completes  the 
positive  branch  of  the  loop  condition. 


Residual  program: 

public  static  void  doit(int  k)  { 

//  Initial  configuration. 
il  =  l; 

//  Basic  configuration. 

Ml:  if  (!(il<=kl))  goto  M2; 

int  nl  =  My.readIntO; 
int  ml  =  My.F(nl); 

My.writelnt(ml); 
il  =  il+1; 
goto  Ml; 

//  Configuration  like  the  basic  one  but  with  restriction  {/i  >kx}. 

M2: 

//  ... 

} 
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The  supercompiler  returns  to  the  configuration  on  the  negative  branch  (labeled  M2),  which  was  laid 
aside  after  generating  the  if  statemait,  and  transiently  drives  it  to  the  end  without  adding  any  new 
^atf^mpuTtR  to  the  residual  program.  Superconq)ilation  ends.  The  final  result  is  as  follows. 


Residual  program  : 


// 

// 


public  static  void  doit(int  k)  { 
Initial  configuration. 
il  =  l; 

Basic  configuration. 

Ml;  if  (!(il<=kl))  goto  M2; 

intnl  =  My.readIntO; 
intml  =  My.F(nl); 
My.wTiteInt(ml); 
il=il+l; 
goto  Ml; 

M2; 

} 


Since  Java  has  neither  the  notion  of  label,  nor  goto  statement,  the  residual  program  must  be 
postprocessed.  All  gotos  should  be  either  folded  into  available  Java  control  constructs,  or  procedures 
with  goto  should  be  split  into  several  copies,  each  goto  becoming  an  auxiliary  procedure  call.  The 
absence  of  goto  is  a  serious  drawback  of  a  language  intended  for  program  transformations. 
Unfortunately,  the  authors  of  Java  did  not  think  about  such  use  of  Java. 

Here  is  the  uitimate  residual  program  after  postprocessuig. 


Residual  program: 

public  static  void  doit(iiit  kl)  { 
for  (int  il=l;  il<=kl;  il++)  { 
int  nl  =  My.readIntO; 
intml  =My.F(nl); 
My.writelnt(ml): 

} 
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Conclusions 


The  example  has  demonstrated  almost  all  techniques  of  supercompilation  which  we  plan  to  implement  in 
the  first  version  of  the  Java  supercompiler.  Most  are  general  methods;  a  few  are  specific  fiar  Java  (loop 
syntax): 

1.  The  notion  of  configuration  -. 

•  A  configuration  is 

•  z  set  of states  of  the  original  program,  as  well  as 

•  a  program  point  of  the  residual  program; 

•  Configuration  variables  are 

•  parameters  of  a  configuration ,  as  well  as 

•  program  variables  of  a  residual  program; 

•  The  configuration  structure  resembles  the  representation  of  program  state  in  the  Java 
interpreter  and  consists  of 

•  a  set  of  threads,  and 

•  a  set  of  objects; 

•  Additional  elements  in  configurations  but  not  interpretation  structures  are 

•  configuration  variables  occurring  instead  of  unknown  values  or  the  unknown 
recursive  tail  of  an  object, 

•  restrictions,  v^Wch  are  predicates  on  configuration  variables, 

•  the  tog  for  objects: 

•  Each  initial  configuration  supplied  by  a  user  generates  a  task  for  supercompiiation. 

2.  The  process  of  driving: 

•  transient  driving:  threads  are  executed  as  ifby  an  interpreter  whoiever  configuration 
variables  do  not  interfere.  The  results  of  some  operations  are  evaluated  even  if  they  depend 
on  configuration  variables  and  are  represented  by  symbolic  expressions. 

•  residualization  of  an  unconditional  statement:  when  a  statement  cannot  be  executed  because 
it  depends  upon  configuration  variables  or  unknown  procedures,  a  copy  of  it  is  put  into  the 
residual  program.  Program  variables  in  the  copy  are  replaced  by  their  values,  possibly 
involving  configuration  variables.  If  needed,  an  unknown  result  of  evaluation  of  an 
expression  is  represented  by  a  new  configuration  variable  if  it  cannot  be  represented  by  a 
symbolic  expression. 

•  branching  {residualization  of  a  conditional  statement):  wdien  the  condition  in  an  if  statement 
cannot  be  evaluated  to  true  or  false,  it  is  residualized  and  the  process  of  supercompiiation 
continues  along  one  of  the  branches,  usually  the  positive  one  first.  When  supercompiiation 
of  the  first  branch  has  completed,  the  second  branch  is  processed.  Other  conditional 
statements,  for,  while,  case,  are  compiled  according  their  semantics  by  reduction  to  if 

•  next  configuration  is)  after  residualization  is  (are)  at  worst  the  same  as  the  one(s)  before. 
However,  any  additional  information  revealed  during  a  step  is  represented  in  the 
configuration,  improving  the  residual  program.  Driving  rules  for  each  Java  operator 
algorithmically  define  how  a  configuration  is  narrowed  after  a  step  by  one  of  two  means: 

•  contraction  of  configuration  variables:  replacing  these  by  constants  or  symbolic 
e>q)ressions  involving  other  or  new  configuration  variables; 
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.  aHHing  an  elementary  predicate  to  the  set  of  restrictions  ofthe  current  configuration. 
A  class  of  elementary  predicates  is  to  be  chosen  by  the  author  of  a  supercompiler . 

For  Java,  we  plan  to  use  inequalities,  ^y<,  ^ 


3.  Configuration  analysis: 

•  The  goal  of  supercompilation  is  to  construct  ti  finite  residual  program^  To  adueve  &is,  a 
supercompiler  analyzes  the  trace  of  driving,  compares  the  current  configuration  with 
previous  ones,  and  takes  the  following  decisions: 

•  looping  back:  when  the  current  configuration  is  a  subset  of  an  old  one  and  can  be 
reduced  to  it  by  a  substitution,  appropriate  assignment  statements  reflectmg  toe 
reduction  and  a  goto’  statement  are  put  into  toe  residual  program.  Additional  rules 
may  preclude  looping  back  in  order  to  construct  a  more  efficient  speciahzed 
program. 

•  generalization:  when  toe  supercompiler  finds  an  old  amfiguration  which  has 

threads  and  objects  with  toe  same  structure  as  toe  current  configuration,  ^cept  for 
variable  values,  it  throws  away  toe  part  of  toe  residual  program  startmg  from  toe 
old  configuration,  builds  the  least  common  generalization  of  toe  configurations  ^d 
makes  it  next  after  toe  old  one.  TTie  corresponding  assignments  to  the  new  vanables 
ofthe  generalized  configuration  are  put  on  toe  arc  from  toe  old  to  toe  generalized 
configuration.  Then  toe  supercompiler  reconstructs  toe  residual  program  startmg 
from  toe  generalized  confi^ration.  ITiis  back-tracking  is  toe  mam  reason  that 
supercompilation  is  often  time-consuming. 

•  cMm>7g®  the  curr«it  or  one  ofthe  previous  configurations  mto  two.  This 

corresponds  to  a  procedure  call  in  toe  residual  program.  The  two  parts  are 
supercompiled  s^arately. 


To  determine  panicular  configurations  to  loop  back,  to  generalize  or  to  cut  :s 
problem  of  supercompilation.  As  a  first  approximation,  we  will  use  the  method  descnbed  . 
^urchin  88]  as  modified  for  multiple  threads.  Its  idea  is  that  only  toe  ^cture  of  threads, 
not  the  values  bound  to  program  variables,  is  used  to  make  decisions.  This  method 
guarantees  termination  of  supercompilation. 


4.  User  control:^  Even  if  the  supercompiler  is  very  clever,  it  is  just  a  machme  and  the  human  user  ^n 
substantionally  help  it  to  catch  basic  configurations,  thus  either  improving  toe  resi  ua 
drastically  decreasing  supercompilation  time.  However,  this  is  not  a  task  for  toe  end-user  w 
taows  nc^ing  about  supercompilation,  and  is  just  a  programmer  usmg  Java 
write  an  application  in  Java.  Authors  of  libraries  of  reusable  software  must  tomk  about  program 
transformations.  We  plan  to  develop  toe  means  to  annotate  library 
information  for  the  supercompiler.  TTie  supercompiler  can  be  given  some  kind 
oreferable  classes  for  basic  configurations.  This  may  mclude  source  program  pomts,  which  may 
"X^ncornp^lerto  trace'and  compare  configurations,  or  to  perform  some  actions  suggested 

by  toe  user, 

5  Uirarm  of  reusable  software:  Supercompilation,  as  well  as  other  program  transfermatiim 

tlTies.  allows  new  meAods  for  stmomring  progranrs,  in  partroular,  W  aotwe  o  ^e^- 
In  order  to  bring  supercompitotion  into  praaice,  we  plan  to  develop  some  demo  hbianes,  mcludmg 
one  to  support  toe  construction  of  data  base  applications 


"  Postprocessor  will  be  used  to  fold  gotos  into  the  Java  loop  constructs  or 
®  This  was  not  demonstrated  by  the  e.\ample. 


to  translate  to  auxiliary  function  calls. 
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Abstract 


An  extension  of  Java,  called  pJava,  which  allows  for  automatic  parallelization  and  efficient 
program  transformation  is  presented.  The  idea  of  pJava  is  to  select  a  Java  subset  with  suitable 
properties,  and  then  gradually  extend  it  whUe  preserving  the  properties.  The  starting  point  of 
pJava  is  a  purely  functional  subset  of  Java  with  data  limited  to  immutable  objects.  Higher- 
level  declarative  notions  will  gradually  be  added  to  allow  for  easy,  reliable,  implicitly  parallel 
programming.  Ultimately  mutable  objects  will  be  allowed  as  well,  but  under  a  special 
programming  discipline.  This  discipline  can  be  satisfied  at  a  low-level  by  a  qualified  Java 
programmer,  or  at  a  high-level  via  constructs  based  on  monotone  objects,  which  can  be  easily 
applied  by  less-experienced  users. 
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introduction 


We  propose  to  develop  a  language  based  on  Java  for  easy  parallel  programming  in  networks.  We  call  it 

pJava.  The  main  reasons  and  goals  for  this  work  are  as  follows. 

Why  Java  as  the  basis? 

•  Java  is  rapidly  becoming  a  wide-spread  language  used  by  millions  of  programmers.  A 
language  that  has  as  many  features  in  common  with  Java  as  possible  will  be  easier  to  learn 
and  to  use.  For  the  same  reason  the  Java  authors  based  their  work  on  C, 

•  Java  already  has  good  basic  properties  for  automatic  parallelization  and  program 
transformation.  Its  authors  have  done  a  good  job  of  taking  C  and  removing  its  poorly  defined 
parts.  In  particular,  the  Java  data  model  is  good 

Why  not  Java?  Why  a  new  dialect? 

•  Java  is  a  low-level  language.  It  was  not  intended  for  automatic  parallelization.  It  has  an 
explicit  method  for  parallel  programming,  that  is,  the  notion  of  threads. 

•  The  level  of  pJava  will  be  higher  than  that  of  Java.  Programs  in  pJava  will  be  suitable  for 
automatic  parallelization,  both  static  (at  compile  time)  as  well  as  dynamic  (at  run  time).  The 
method  for  parallel  programming  will  be  implicit  and  simpler  to  port  to  new  computers. 

What  is  the  relation  of  pJava  to  Java? 

•  First,  we  select  an  almost  universal  subset  of  Java,  which  supports  program  transformation 
and  is  suitable  for  automatic  parallelization.  The  subset  is  appro.ximaiely  a  single-assignment 
functional  language.  It  is  value-oriented  rather  than  object-oriented.  Only  immutable^ 
objects  are  allowed  to  model  complex  values. 

•  Second,  we  extend  it  with  high-level  constructs,  which  do  not  interfere  with  the  goal  of 
automatic  parallelization.  Extensions  are  done  in  cycles,  gradually  introducing  new  notions: 

•  Value-oriented: 

•  Notions  from  functional  programming,  including  recursive  data  definitions 
of  the  form  x  -  f  (x) ,  the  semantics  of  wWch  requires  lazy  evaluation. 

•  Higher  le\^el  declarative  notions  from  specific  applicative  areas,  especially 
based  on  e.xperience  from  the  Norma  project  [Zadykhailo  et  al  96] 

•  Object-oriented: 

•  Restricted  Java  classes,  which  allow  automatic  parallelization.  These  are 
referred  to  as  monotone  objects  [Klimov  91]. 

•  Other  Java  means  (e.g.  threads  and  arbitrary'  non-monotone  classes)  will  not  be  prohibited. 
These  are  regarded  as  low-level,  and  not  recommended  for  ordinan^  users,  but  allowed  (and 
actually  needed!)  for  experienced  programmers.  The  pJa\'a  s>*siem  will  contain  an  anah'zer 
that  will  armotate  a  program  b>^  telling  what  parts  of  a  program  belong  to  “true”  higher-level 
pJava.  and  what  are  lower-le\'el.  This  is  not  just  pa>ing  tribute  to  Java.  It  is  our  strong 
opinion  that  a  language  containing  only  high-le\'el  notions  would  be  impractical.  Moreover, 


^  An  object  is  immutable  if  it  does  not  change  after  a  constructor  has  initialized  it.  This  is  the  defining 
property  of  a  class  that  can  be  checked  by  the  pJava  system. 


the  idea  of  a  monotone  object  substantially  requires  that  infinitely  many  monotone  objects  be 
programmed  by  low-level  means. 

What  is  the  background  of  our  research  into  automatic  parallelization  of  pJava? 

•  Traditional  methods  of  automatic  parallelization  based  of  functional,  single  assignment  data 
flow  languages. 

•  The  experience  in  parallelization  of  higher-level  declarative  languages  (like  Norma 
[Zacfykhailo  et  al  96]). 

•  Our  experience  in  constructing  monotone  object  classes,  which  allow  for  automatic 
parallelization  and  dynamic  program  transformation  [Klimov  91]. 

•  Achievements  in  deep  program  transformation  disciplines  such  as  partial  evaluation  [Jones  et 
al  93]  and  supercompilation  [Turchin  86],  and  our  results  in  applying  these  to  Java. 

What  are  the  characteristic  features  of  the  approach? 

•  Implicit  parallelism  of  higher-level  language  notions  for  most  users,  and  explicit  notions  for 
parallel  programming  only  for  experienced  programmers,  creating  low-level  parts  of  libraries 
of  reusable  software. 

•  Development  of  high-level  languages,  up  to  declarative,  for  particular  application  areas,  as  an 
extension  of  a  common  language,  Java.  High-level  language  notions  are  much  better 
parallelized  and  transformed  than  low-level  ones. 

•  Easy  parallel  programming  bv’  the  masses  as  the  main  goal.  This  goal  is  the  reason  for  our 
preference  of  high-level  notions.  If  efficiency  and  simpliciU'  of  use  conflict,  we  prefer  to 
sacrifice  some  efficiency  to  retain  ease  of  use.  keeping  in  mind  that  computer  speeds  almost 
double  each  year. 

•  Our  approach  differs  from  that  of  languages  such  as  HPF(ortran),  HPJava  [PCRC  96],  which 
are  based  on  e.xtending  a  language  by  explicit  low-level  parallel  methods,  intended  for 
experienced  programmers. 

•  Use  of  modem  program  transformation  techniques  like  supercompilation  and  partial 
evaluation  to  optimize  programs  deeply  and  to  support  automatic  parallelization.  A  motto. 
“Language  properties  required  for  automatic  parallelization  and  efficient  program 
transformation  are  the  same.”  Languages  well-suited  for  one  goal  are  well-suited  for  the 
other  as  well.  Hence,  our  simultaneous  development  of  a  language  for  both  goals. 

•  Optimization  of  parallel  programs  not  only  by  parallelization,  but  by  ^sequencialization  ”  as 
well,  that  is.  ly'  fusing  threads  and  parallel  processes  to  form  smaller  numbers  of  threads  and 
processes  with  fewer  delays  for  interactions.  We  consider  pJava  a  high-level  language  with 
highly  parallel  interpretive  semantics.  To  improve  its  programs,  we  either  e.xtract  this 
implicit  parallelism  by’  converting  the  interpreter  to  a  compiler  and  mapping  it  onto  a  modem 
computer  architecture,  or  reduce  inefficient  concurrency  by'  fusing  threads. 

•  Constraction  of  demo  libraries  of  reusable  software  (for  example,  for  data  base  applications), 
making  use  of  our  methods.  To  bring  the  new  methods  into  practice,  the  authors  will  develop 
programs  that  demonstrate  the  value  of  the  methods. 
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pJava  Language  Structure 

pJava  is  a  superset  of  Java  consisting  of  two  parts: 

•  A  high-level  part,  which  contains  no  explicit  parallel  notions  and  is  intended  for  easy  programming 
and  automatic  parallelization.  It  is  based  on  a  suitable  Java  subset  and  then  is  gradually  extended 
preserving  the  required  properties.  Call  this  “pJava  proper”. 

•  A  low-level  part,  which  is  a  subset  of  Java,  which  contains  explicit  notions  for  parallel  programming. 

Together,  the  low-level  and  high-level  parts  cover  all  of  Java.  Actually,  the  low-level  part  may  be  defined 
as  containing  all  Java  notions  except  those  belonging  to  the  high-level  part.  Hence,  the  user  need  not 
limit  himself  to  pJava  proper.  A  preprocessor  will  annotate  each  program  telling  which  parts  belong  to 
the  high  and  low  levels.  Such  a  preprocessor  will  not  only  help  users,  but  also  is  part  of  an  automatic 
parallelization  tool  for  high-level  parts  of  programs. 

The  high-level  parts  of  pJava  consist  of  se%’eral  layers  of  notions,  each  more  complex  than  the  previous 
one  and  relying  on  it.  The  layers  of  pJava  are  split  into  two  parts:  value-oriented  and  object-oriented. 


Value-oriented  levels 

The  key  idea  of  the  value-oriented  part  is  that  mutable  objects  are  prohibited.  The  values  of  data  entities 
cannot  change.  Although  objert-oriented  Ja\'a  notions  such  as  classes  and  inheritance  are  present  at  these 
levels,  they  are  used  only  to  model  values,  that  is,  immutable  data. 

Definition.  A  class  and  its  objects  are  immutable  if  only  its  constructor  sets,  or  otherwise  changes,  the 
value  of  its  local  variables,  and  local  objects  are  also  immutable. 

Statement  There  exists  a  Java  program  analyzer  that  can  check  whether  a  class  is  mutable  or  not. 

Note.  The  final  tj-pe  modifier  of  Java  supports  the  notion  of  immutable  objects,  and  allows  the  user  to 
supply  additional  information  for  the  anah’zer. 

1®*  level 

First,  we  select  a  subset  of  Java,  that  approximately  corresponds  to  afunctional  language,  and  allows  for 
automatic  parallelization. 

•  Data:  Simple  Java  types  (numeric,  characters.  Boolean)  and  immutable  classes.  No  arrays.  Only  tree 

structures  can  be  constmaed  these  means;  not  graphs. 

•  Control:  Procedures  programmed  in  single-assignment  style.  No  threads.  Exceptions  are  allowed 
under  certain  conditions. 

2"“  level 

Second  we  change  and  extend  the  semantics  of  the  I**  level,  preserving  the  syntax. 

Statement  The  level  allows  for  the  following  changes  from  sequential  to  parallel  semantics, 
preserving  the  upwards  equivalence  of  the  result  of  comjjutation: 

•  Each  class  is  modified  to  be  a  subclass  of  a  special  class  isReady  that  controls  an 
additional  tag  for  each  variable  telling  whether  its  value  is  ready,  or  not.  After  a  variable  has 
been  created  the  tag  equals  not  ready.  After  the  variable  takes  its  value  for  the  first  time,  the 
tag  changes  to  ready.  After  this,  the  value  does  not  change. 
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•  Class  IsReady  has  procedures,  put  and  get,  which  are  synchronized  (in  Java  terms); 
get  waits  for  ready,  put  sends  ready  signals  after  it  has  assigned  a  value  to  a  variable. 

•  Each  procedure  call  is  executed  by  starting  a  new  thread. 

Statement  The  extension  of  the  semantics  is  a  proper  extension: 

•  If  a  program  returns  a  result  by  the  sequential  semantics,  it  returns  the  same  result  by  the 
parallel  semantics. 

•  If  a  program  returns  a  result  by  the  parallel  semantics,  it  may  execute  infinitely  or  reach 
deadlock  under  sequential  semantics. 

Thus,  we  come  to  the  first  extension  of  Java,  which  we  consider  the  2"^  level. 

Exploiting  this  semantics  extension,  new  programs  may  be  written  to  define  complex  data  by  means  of 
recursive  equations  of  the  form  x  =  /(x) ,  expressed  textually  as  an  ordinary  assignment.  For  example, 
consider  an  immutable  class  to  represent  integer  lists,  IntList: 
class  IntList  { 


}; 

class  IntNil  extends  IntList  { 
IntNilO  {); 

}; 

class  IntCons  extends  IntList  { 
int  head; 

IntList  tail; 

IntCons (int  head,  IntList  tail)  ( 
this. head  =  head; 
this. tail  =  tail; 


} 

Then  assignments  of  the  following  form  make  sense: 

IntList  X  =  F(x); 

For  example,  an  F  which  defines  x  to  be  a  list  of  natural  numbers  from  1  to  100  can  be  coded  as  follows: 

IntList  F (IntList  x)  { 

return  new  IntCons (1,  x.incO .take(99) ) ; 

1 

where  the  methods  inc  (increment  all  integers  in  a  list  by  1)  and  take  (take  first  length  elements  of  a 
list )  are  declared  in  the  class  IntList: 

class  IntList  { 

Int.List  inc()  { 
return 

this  instanceof  IntNil  ? 
this 

:  new  IntCons ( { (IntCons) this) .head+1, 

( ( IntCons ) this ) . tail . inc ( ) ) ; 

} 

IntList  take (int  length)  { 
return 

this  instanceof  IntNil  I  I  length==0  ? 

(IntList)  new  IntNilO 
:  new  IntCons (( (IntCons) this) .head, 

(  (IntCons)  this)  .tail,  take  (length-1)  ; 

} 

1 
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The  procedures  inc  and  take  can  execute  in  parallel  because  of  the  extension  of  the  semantics.  Old 
Java  syntax  is  used.  No  new  syntax  notions  are  needed. 

3"^  level 

Here  lies  a  large  world  of  declarative  languages,  e.g.,  functional,  logical,  and  constraint  languages.  First 
of  all,  we  will  develop  a  Norma-like  extension  of  Java. 

The  value-oriented  part  of  pJava  will  not  allow  construction  of  arbitrary  data  structures.  The  most 
complex  structures  that  can  be  represented  using  tail  recursion  on  immutable  objects  are  trees  and  their 
simpler  degenerate  forms,  such  as  linear  lists.  This  limitation  gives  us  a  rational  reason  to  include 
mutable  data  in  further  extensions  of  pJava.  Their  use  is  not  just  a  matter  of  improving  efSciency,  as  in 
the  usual  arguments  against  the  use  of  functional  languages.  Mutable  data  are  necessary. 


Object-oriented  level 

Fortunately,  a  large  number  of  classes  exist  which  can  be  programmed  by  low-level  means  using 
synchronized  threads,  but  preserve  all  properties  essential  for  program  transformation  when  used  at 
the  value-oriented  level.  These  classes  (and  their  objects)  are  referred  to  as  monotone  [Klimov  91]. 

The  notion  of  a  monotone  class 

Definition.  A  set  of  classes  are  called  monotone  if  any  Java  program  that  uses  the  classes  and  is  written 
so  as  to  satisfy  the  restrictions  of  the  value-oriented  level  meets  the  following  properties; 

1.  determinacy:  evaluating  procedures  in  arbitrary  order  according  to  parallel  semantics  gives 
the  same  result; 

2.  recomputabiUtv:  if  a  procedure  is  e\'aluated  with  the  same  arguments  for  a  second  time,  it 
returns  an  equivalent  result,  and  produces  no  new  side  effects. 

The  first  propert\-  means  that  equivalent  parallel  codes  can  exist.  It  makes  automatic  parallelization 
possible.  The  second  propert\-  allows  reliable  computations;  if  a  processor  on  which  a  process  was 
scheduled  faik  the  process  may  be  rescheduled  and  started  from  the  beginning.  Our  research  into 
supercompilation  has  shown  that  the  second  property  is  very  important  for  effective  program 
transformations  as  well. 

An  example  of  a  monotone  class 

The  following  class  defines  objects  which  behave  like  variables  with  values  that  can  be  assigned  only 
once.  It  is  the  simplest  monotone  example. 

class  IntVar  { 
int  value; 

boolean  ready  =  false;  .  _  t 

synchronized  void  put (int  n)  throws  ContradictoryAssignmen  { 
if  (ready  &&  value  !=  n)  throw  ContradictoryAssignment; 
value  =  n; 
ready  =  true; 
notify ( ) ; 

} 

synchronized  int  get (int  n)  { 
if  (! ready)  wait(); 
return  n; 

1 
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Monotone  object-oriented  programming 

In  a  similar  way  a  librarv  of  basic  monotone  classes  can  be  constructed  These  use  low-level  mechanisms 
like  synchronized  arid  boolean  ready.  The  must  be  written  by  a  qualified  programmer.  However, 
when  one  defines  higher-level  objects  by  using  only  monotone  objects  as  building  blocks  and  no  explicit 
thread-oriented  notions  are  used  the  new  objects  are  automatically  monotone.  Having  libraries  m 
monotone  functions  will  allow  complex  application  programs,  which  are  parallel  by  construction,  to  be 
written  by  ordinary  Java  programmers. 

Monotone  classes  allow  construction  of  arbitrary  complex  data,  up  to  graphs.  Our  experience  has  shown 
that  although  monotone  objects  do  not  allow  arbitrary  change  of  state,  reactive  and  dialogue  systems  can 
be  programmed  using  a  small  number  of  basic  monotone  objects  that  are  programmed  by  low-level  means. 
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Conclusions 


We  intend  to  use  this  sequence  of  extensions  to  develop  pJava  from  Java.  It  will  be  a  flmctional  language 
extended  to  include  at  least  write-once  semantics  for  data  variables,  but  will  retain  the  familiar  syntax  of 
ordinary  Java. 

Programs  based  on  the  functional  subset  of  pJava  will  allow  program  transformation  and  automatic 
parallelization  techniques  to  be  used  to  form  faster  running  versions  of  the  code  for  specific  purposes.  For 
example,  a  fast  code  for  multiplying  100x100  matrices  can  be  derived  from  a  generic  library  routine  for 
multiplying  NxN  matrices.  The  routine  can  easily  be  ported  for  efScient  execution  on  any  machine  for 
which  a  pJava  interpretation  or  compilation  system  exists. 

Writing  programs  in  pJava  will  also  allow  graphical  composition  of  pJava  library  routines  to  form  fest 
running  parallel  Solutions  for  the  specific  application  needs  of  masses  of  users.  Whatever  code  is 
produced  by  composition  of  standard  pJava  routines  can  be  optimized  to  run  faster,  at  least  by  eliminating 
the  parts  of  the  combined  routines  that  are  not  used  in  a  particular  composite  calculation.  This  need  will 
grow  as  more  and  more  people  want  to  use  the  Internet  to  find  fast  answers  to  their  own  questions. 
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1  Technical  background  of  the  NORMA  language  creation. 

1.1  The  first  works. 

The  first  publications  appeared  in  late  50s  -  early  60s  [1,2].  The  main  idea  of  the  language  later 
named  NORMA  is  very  simple.  It  was  an  attempt  to  automate  the  design  of  the  programs  based  on 
the  jobs  prepared  by  applied  mathematicians  from  Keldysh  Institute  of  Applied  Mathematics  for 
further  programming.  Usually  those  jobs  were  the  result  of  applying  numerical  methods  (more  often 
grid  method)  to  physical  problems’  solution.  The  intention  was  to  create  the  language  of  jobs' 
specification  corresponding  to  the  constructions  obtained  after  mathematical  solution  of  the  problem. 

At  first  the  authors  called  such  method  of  specification  “parameter  record"  later  they  used 
term  “nonprocedural  specification"  nowadays  term  “declarative  specification"  is  often  in  use. 

Having  analysed  the  material  on  this  subject  the  authors  came  to  the  conclusion  that  analysis 
of  index  dependencies  appeared  in  the  relations  between  variables  when  continual  equations  are 
discretised  would  be  the  fact  of  great  importance  in  designing  program  on  declarative  specification. 

Let’s  consider  briefly  the  first  attempt  to  analyse  such  relations.  There  settled  the  task  to 
analyse  specifications  of  the  following  type: 

Xi  = 

Xi  —  f  (.ATi-Aj  , ,  1-A2.J  ,■■■■>  AiTi-Ajj, )  ^-l  J 

It  should  be  noted  that  the  problem  of  parallelism  wasn’t  urgent  at  that  time  because  there 
were  no  proper  computers  that’s  why  they  solved  the  problem  of  automatic  design  of  the  loop  process 
for  computations  set  by  relations  (1).  One  can  see  that  a  relation  is  a  construction  where  the  variable 
with  indices  is  placed  in  the  left  part  and  the  function  from  the  variables  with  indices  in  the  right  part. 
In  this  particular  case  the  variables  have  only  one  index  and  displacement  A;.;  k,l=\,...,n  is  an 

integer  constant.  General  form  of  relations  will  be  considered  below. 

p 

Let’s  consider  that  the  values  of  variables  Xi  /?  =  1, . . .  are  to  be  calculated  for  the  value 
of  one-dimensional  domain  set  by  integer  numbers  in  the  ranges  from  /w®(/;)  to 
m°{p)»m^{p),  p  =  \,...,n,  /w°(/7)  -  boundaries  of  the  calculated  values.  Let’s  also 

assume  that  all  the  values  to  the  left  of  fn^{p)  are  known.  The  last  assumption  is  lies  in  the  fact  that 
a  full-connected  graph  corresponds  to  the  dependencies  between  the  variables.  The  arcs  of  the  graph 

are  considered  to  connect  the  graph’s  node  X  with  node  X  if  X  is  in  the  left  part  of  the  relation 

'  ... 
and  X  is  in  the  right  one.  We  consider  the  full-connected  graph  because  it  has  many  difficulties  in 

solving. 

Matrix  H  with  elements  h'ip,q)  =  m’(q)-m'(p)  +  Aip,q),  p,q=l,...,n  is  the  main 
object  the  method  of  solution  is  based  on.  The  variables  with  index  /  are  the  current  state  of  the 
boundaries  after  step  t .  The  values  of  the  components  which  have  the  arguments  (i.e.  they  have 
been  already  computed  or  set  as  the  initial  values)  are  assumed  to  be  calculated  at  every  step.  Note 

p 

that  h‘{p,q)  is  the  number  of  components’  values  which  could  be  computed  for  variable  X  if  it  is 
depended  only  ox\  X . 

Let’s  give  name  “chain”  to  any  sequence  from  k  numbers,  k<n  where  all 

are  different.  The  sequence  of  the  following  type  from  the  elements  of  matrix  H  may  be  built. 
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Let’s  name  sum  of  all  the  elements  of  chain  z  weight  p'{z)  of  this  chain  and  variable 

S'{z)  =  p\z)lk  reduced  weight.  It  is  easy  to  see  that  p‘  and  S‘  for  every  chain  don't  depend  on 
the  system  of  the  boundaries  and  determined  by  the  index  displacements  only: 

k  ~  k 

Corresponding  methods  of  research  in  organising  the  loop  process  of  computations  is  developed  in 

PI- 

Let's  generalise  the  results  obtained  in  [2].  Let  the  chain  with  the  minimal  reduced  weight 

be  found  (an  algorithm  of  determination  of  chain  is  given  in  [2]  and  has  estimate 

crP  logj  n,  c  -  constant)  then  the  following  theorem  is  correct: 

Theorem.  If  the  minimal  reduced  weight  of  the  chain  from  (1)  S^^<0  then  the  computation  is 

impossible.  If  Si^>0  then  the  computations  may  be  organised  in  the  way  that  S^  n  values  of  the 

variables  on  the  average  could  be  computed  at  every  step. 

Here  the  term  “on  the  average”  is  used  in  the  sense  of  the  possibility  of  computing  different 

numbers  of  the  variables  at  different  steps  in  the  case  of  non-integer  but  np  values  of  the 

variables  are  to  be  computed  for  k  sequential  steps. 

Corollary  1.  Mere  sequential  computation  appears  in  the  case  when  is  reached  on  the 

chain  with  the  length  n  and  weight  1,  i.e.  in  the  system  with  the  minimal  reduced  weight  S^  =  \/n. 

Corollary  2.  Concurrent  computation  of  every  variable's  values  at  step  k  is  possible  when 
S,>1. 

Corollary  3.  If  >  1,  is  integer  then  5^  values  of  every  variable  can  be  computed  at 
very  step.  The  regular  computation  similar  to  this  one  may  be  organised  neglecting  the  fractional  part 
if  isn’t  integer. 

It  should  be  noted  that  though  the  results  given  above  follow  simply  from  wor^  [2]  they  are 
original  and  show  the  influence  of  index  dependencies  on  the  possibilities  of  parallelising.  Authors 
consider  the  research  done  on  this  grant  and  obtained  results  very  useful. 

Given  methods  didn't  have  any  influence  on  the  design  of  the  translator  from  the  language  of 
declarative  specification  as  A.N.  Andrianov  constructs  general  method  of  computation  process 
organization  (parallel,  sequential,  parallel-sequential)  for  the  systems  of  relations  in  rather  general 
form  (multidimensional  case  for  the  computational  domains  with  complicated  configuration  is 
considered).  Theoretical  results  and  program  implementation  of  some  algorithms  based  on  these 
results  have  been  obtained. 

One  more  note  should  be  done.  It  is  clear  from  the  task's  statement  that  we'll  consider 
extracting  natural  parallelism  represented  in  the  relations.  We  aren't  going  to  use  any  artificial 
methods  dealt  with  e.g.  transformation  of  the  program.  E.g.  it  is  generally  known  that  the  elements  of 
some  tasks  can  be  reduced  to  the  form: 

where  the  values  of  a,  and  h.  on  grid  [1,;;]  and  initial  value  are  assumed  to  be  set.  vv  is  to  be 
computed  on  grid  [1,//].  The  computation  by  our  methods  will  be  evidently  sequential.  Really  here  n=1 
and  =  1  so  the  only  one  value  will  be  computed  at  every  step. 

But  it  is  well-known  that  there  exists  method  of  parallelising  such  a  relation.  Authors  never 
refuse  to  include  such  methods  and  even  parts  of  the  programs  in  other  languages  into  translating 
system.  Further  more  the  problem  of  including  Norma  extracts  into  other  languages  (we  mean 
FORTRAN  versions)  should  be  discussed. 

The  authors  also  faced  the  problem  of  limiting  the  capabilities  for  parallelising  (the  problem  of 
parallelism  granularity)  or  imposing  the  constraints  on  the  order  of  computations  (e.g.  caused  by  the 
requirements  to  the  precision  of  computations). 


4 


To  conclude  this  part  we  should  note  that  its  aim  was  to  report  some  details  about  the  history 
of  the  approach  to  the  NORMA  language  design  and  to  explain  principal  notions  and  problems 
appearing  in  the  solution  process  on  simple  examples. 

1.2  The  advantages  of  the  approach  based  on  the  NORMA  language  application. 

1.2.1  Extremely  high  level  of  the  language.  New  level  of  stability 

We  have  gained  much  experience  in  designing  complicated  program  systems  and  translators,  now  it 
is  time  to  create  really  friendly  programming  languages.  We  shall  take  a  decisive  step  and  turn  from 
universal  languages  to  the  languages  for  users  to  formulate  problem  solution  in  generic  terms.  We 
are  sure  that  universal  languages  may  be  friendly  only  to  system  programmer.  Hence  we  shall  bend 
every  effort  to  creation  of  specialized  language  for  each  application  domain. 

It  is  clear  that  such  a  language  will  be  changed  as  soon  as  the  application  domain  is 
changed.  So  extreme  level  of  the  language  provides  new  level  of  the  language  stability.  Let’s  note 
that  the  languages  of  universal  type  are  constantly  being  specified  to  get  a  new  level  of  conveniences 
in  programming  or  to  consider  new  computer  resources.  Constant  modernisation  of  the  FORTRAN 
language  is  a  typical  example.  The  idea  of  creating  specialized  languages  has  been  known  for  a  long 
time  but  it  is  time  to  pay  much  more  attention  to  this  approach.  We  consider  this  subject  to  become 
first  and  foremost  in  the  programming. 

We  know  by  experience  that  terminology  (languages)  used  by  technical  experts  in  the 
complex  application  domains  gives  us  a  good  opportunity  to  consider  the  peculiarities  of 
computational  environment.  The  systems  of  notions  in  the  application  domain  has  been  always  based 
on  the  mathematical  notions  and  it  explains  this  phenomenon.  It  is  worth  to  mention  that  there  is  no 
notion  “memory”  in  such  languages  hence  there  is  a  principle  of  single  assignment. 

1.2.2  Portability 

As  the  specification  in  the  NORMA  language  contains  full  specification  of  an  algorithm  and  doesn’t 
reflect  any  peculiarities  of  the  computer.  It  may  be  implemented  on  the  computer  with  any 
architecture  both  sequential  or  parallel.  Synthesising  translator  must  be  automatically  adapted  to  the 
peculiarities  of  the  architecture  or  must  ailow  for  the  peculiar  architecture. 

1.2.3  New  level  of  programming  reliability 

As  the  user  is  free  from  the  necessity  of  making  a  program  (this  part  is  carried  out  by  a  translator) 
then  such  process  of  programming  doesn’t  cause  mistakes  (up  to  the  reliability  of  a  translator). 
Further  more  besides  the  traditional  syntactical  and  semantic  diagnostics  the  synthesising  translator 
can  send  messages  about  the  errors  in  the  essential  notions.  E.g.,  impossibility  of  organising 
computation  caused  by  insufficient  initial  data,  by  the  mistakes  in  the  index  displacements  etc. 

1.2.4  Efficiency  of  the  programs 

High  efficiency  of  automatically  designed  program  is  based  on  the  capability  of  deep  parallelising  and 
providing  the  necessary  level  of  parallelism  granularity.  Generally  speaking  the  synthesising 
translator  has  the  capability  of  estimating  the  different  variants  of  the  representation  of  the  declarative 
specification  in  the  program  and  finding  of  the  best  one  according  to  the  built-in  rule  or  in  the  dialogue 
with  the  system  programmer  or  the  user  himself. 
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1 .3  Particularising  the  statements  on  the  example. 

There  some  characteristics  of  the  language  only  in  connection  with  the  example  given  below  will  be 
briefly  described.  More  details  about  the  language  and  the  state  of  work  are  given  in  the  next  part. 

The  example  given  below  is  taken  from  the  article  [4]  where  the  author,  Lesly  Lamport  uses 
this  example  to  explain  the  method  of  hyperplanes  applied  to  the  automated  parallelising  of  loops: 

DO  99  l=l,L 
DO  99  J=2,M 

DO  99  K=2,N  (2) 

U(J,K)*(U{J+1,K)+U(J,K+1)+U(J-1,K)+U(J,K-1))*0.25 
99  CONTINUE 

This  extract  is  a  simplified  variant  of  difference  scheme  for  solving  the  task  of  Laplace  on 
rectangular  grid  iJ.K)  by  fixed  number  of  iteration  on  /.  The  constraints  imposed  by  this  construction 
on  the  computations’  order  require  the  analysis  of  the  computations’  order  (which  is  set  by  the  nesting 
of  loops),  the  analysis  of  variables’  values(which  is  necessary  because  of  the  reassignment  occurred 
in  carrying  out  the  program),  and  also  the  analysis  of  index  dependencies.  All  these  facts  must  be 
taken  into  consideration  for  the  equal  results  both  in  parallel  and  sequential  variants. 

In  the  result  of  the  analysis  we  can  state  that  the  values  of  the  variables  having  index 
displacements  M  and  K■^  are  taken  from  iteration  /  and  the  rest  ones  from  iteration  M  in  the 
computation  of  the  values  in  some  point  of  3-dimensional  space  with  coordinates  {fJJC)  (though  only 
plane  (JJC)  is  set  explicitly,  variable  I  introduces  one  more  coordinate  implicitly).  All  the  values  of  U 
are  considered  to  be  known  on  all  the  boundaries  and  independent  of  I.  Thus  at  the  first  step  only  the 
values  of  variable  U(2,2)  when  7=1  can  be  calculated  ;  at  the  second  step  -  U(2,3)  and  U(3,2) 
when  /  =  1 :  at  the  third  -  U(2,4),  U(4,2),  U(3,3)  when  /  =  1  and  U(2,2,)  when  7=2.  They  proved 

that  there  can  be  found  the  family  of  the  planes  where  the  values  of  variable  U  in  all  the  points 
belonged  to  the  next  plane  in  parallel  and  independent  way  can  be  calculated  at  every  step  (details 
see  in  the  article  by  Lamport). 

Note  that  this  extract  may  be  represented  in  the  following  form: 

+U(J-1,K,I)+U(J,K-1,I))*0.25  (3) 

K=2,...,N; 

Up  to  the  notations  this  variant  is  a  generic  mathematical  specification  where  iteration  index 
is  consider  in  a  natural  way.  Besides  the  specification  doesn’t  contain  reassignments  of  the  values  to 
the  variables  (reuse  of  memory).  So  in  extracting  parallel  actions  on  the  specification  (3)  it  isn’t 
necessary  to  take  into  consideration  neither  this  fact  (we  deal  with  the  language  with  single 
assignment)  nor  the  order  of  computing  loop  iteration.  Only  the  problem  of  analysis  of  index 
dependencies  is  left. 

As  in  such  a  description  the  mathematician  sees  the  geometrical  figure  -  parallelepiped  then 
he  mustn’t  forget  the  to  set  known  values  on  the  boundaries  (the  faces  of  parallelepiped)  required  for 
the  computation  or  he  will  take  care  of  their  computation.  Even  if  he  has  forgotten  about  them  then 
the  message  about  the  error  may  be  sent  in  the  essential  terms  with  the  reference  to  the 
corresponding  face. 

Now  let’s  consider  the  simplest  variant  of  the  task’s  solution  scheme: 

U(J,K,I)=(U{J+1,K,I-1)+U(J,K,I-1) 

+U{J-1,KJ-1)+U(J,K-1,I-1))*0.25  (4) 

J*2,...,M;  K®2,...,N;  1=1, ..vL. 

This  variant  is  obtained  in  natural  way  when  the  values  at  a  new  iteration  is  obtained  by  the 
values  at  the  previous  iteration.  This  variant  allows  wide  and  evident  parallelising  (all  the  points  at  a 
new  iteration  level  may  be  calculated  simultaneously). 
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The  second  variant  complicates  finding  the  parallelism  and  decreases  the  available  level  of 
parallelising.  This  variant  of  specification  may  be  explained  either  by  the  intention  of  algorithm 
optimization  by  means  of  saving  the  memory  or  by  the  wish  of  decreasing  the  number  of  iteration  or 
by  both.  This  approach  is  acceptable  for  sequential  realization.  But  loosing  the  capability  for 
parallelising  is  may  be  fatal  in  the  parallel  processing  environment.  Hence  the  idea  of  systematical 
transit  from  the  sequential  program  to  the  parallel  one  (e.g.  in  HPF  [5])  doesn’t  always  produce  the 
acceptable  results.  On  the  contrary  we  consider  systematical  transit  from  the  specification  allowing 
maximum  parallelising  to  the  variants  optimising  the  necessary  characteristics  by  decreasing  the  level 
of  parallelising  rather  promising  approach.  Note  that  algorithm  for  parallelising  for  form  (3)  may  give 
(in  the  case  of  memory  optimization)  the  same  level  of  parallelising  and  memory  capacity  as  the 
method  of  hyperplanes  by  Lamport  for  extract  (2). 

Nowadays  there  are  adherents  of  the  systematical  transit  from  the  sequential  programs  to 
parallel  ones.  It  refers  to  some  extant  to  the  supporters  of  HPF  approach  mentioned  above.  But  there 
are  some  opponents  of  such  a  method.  Let’s  cite  the  extract  from  the  article  by  P.H.  Welch,  G.R.R. 
Justo  [6]: 

•  design  standards  that  exclude  parallelism  also  exclude  security  for  complex  applications.  This  leads 
to  growing  losses  -  both  financial  and  human  life; 

•  efficient  and  robust  systems  cannot  be  built  by  “first  getting  them  to  work  serially  on  one  processor' 
and  then  “parallelising"  them  ; 

•  existing  “dusty-deck"  codes  that  represent  massive  financial  investments  that  “cannot  afford  to  be 
wasted”,  also  represent  massive  serial  codes  that  are  becoming  unmaintainable  and  are  certainly 
unverifiable.  These  are  technical  dead-ends  -  as  commercial  pressures  will  gradually  make  clear  to  all 
those  who  persist  with  them; 

•  tools  to  assist  the  parallelization  of  large-scale  serial  code  are  very  difficult  to  make,  will  be  very 
expensive  to  buy  and  will  not  be  needed  by  the  time  they  are  half-made  to  work. 

From  our  point  of  view  instead  of  transit  from  the  sequential  version  of  the  program  to  the 
parallel  one  or  direct  design  of  the  parallel  program  there  is  a  more  promising  third  way  when  the 
original  statement  of  a  problem  may  be  realized  both  in  parallel  and  sequential  variants.  This 
approach  is  based  on  the  main  principle:  when  formulating  a  new  task  don’t  impose  extra  constraints  - 
further  you  may  face  such  an  environment  where  they  cause  inconvenience  and  inefficiency. 
Fortunately  mathematical  specifications  are  kept  to  this  principle  nearly  always. 

1.4  Principal  notions  and  constructions  of  the  NORMA  language 

The  NORMA  language  was  originally  created  for  specifying  the  problems  of  mathematical  physics  by 
difference  methods.  A  particular  system  of  notations  is  formed  in  this  application  domain.  It  is  used  by 
technical  expert  in  writing  formulae  obtained  in  the  process  of  working  out  solution’s  method. 

This  system  of  notations  contains  both  common  mathematical  notions  (e.g.  integer,  real 
numbers,  vectors,  matrices,  functions  and  rows)  and  the  notions  typical  for  the  given  application 
domain  (e.g.  grid,  index  space,  grid  function,  iteration  on  time  and  space). 

Some  abilities  of  the  NORMA  language  are  demonstrated  below.  More  detailed  specification 

is  given  in  [7]. 

The  notion  domain  is  introduced  in  the  NORMA  language  for  representation  of  index  space. 
Domain  is  a  complex  of  integer  sets  w  >  0  each  of  them  sets  coordinates  of  the  point  from 

//-dimensional  index  space.  Unique  name  -  index  name  (name  of  index  space  s  coordinate  axe)  is 
linked  to  every  direction  (coordinate  axe)  of  //-dimensional  space.  Particular  case  of  the  notion 
domain  is  gnd  -  logical  rectangular  domain. 

One-dimensional  domain  is  used  for  setting  the  range  of  the  points  on  some  coordinate  axe  of 
index  space.  The  name  of  one-dimensional  domain,  the  name  of  index  and  the  range  of  changing 
index  values  are  indicated  in  the  simple  case  of  declaration  of  one-dimensional  domain: 

RegionK:  (kBl..15). 

The  boundaries  of  the  range  may  be  set  by  constant  expressions,  e.g. 

GridMN:  (i®M+3..2*N). 
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Multidimensional  domain  is  built  by  operation  of  domains’  product.  The  example  of  two- 
dimensional  domain’s  declaration  is  given  below.  Two-dimensional  domain  is  obtained  by  domains 

AxisK  and  AxisL  product: 

Square:  (AxisK:  (k=1..15)  ;  AxIsL:  (1*1  ..5)). 

Modi^cation  of  a  domain  includes  adding  some  points,  deleting  some  points  or  changing  the 

rsnQS 

Domain  may  be  conditional  and  unconditional.  Conditional  domain  consists  of  the  points  from 
index  space  which  number  and  coordinates  may  be  changed  depending  on  satisfying  or  failure  of 
satisfying  the  conditions  on  domain.  Unconditional  domain  consists  of  the  points  from  index  space 
which  number  and  coordinates  may  be  determined  at  the  translation  stage. 

It  should  be  noted  that  the  domain  determines  the  values  of  index  space  points’  coordinates 

but  not  the  values  of  calculated  variables  in  these  points.  .  ui  u  * 

Scalar  variables  (scalars)  are  used  in  NORMA  for  representation  of  simple  variables  but 
variables  defined  on  domain  are  used  for  representation  of  vectors,  arrays  and  matrixes.  Declaration 
of  the  variable  sets  its  type  -  REAL,  INTEGER,  or  DOUBLE  and  the  variable  on  domain  is 
connected  with  the  domain  indicated  in  the  declaration  (i.e.  the  values  of  this  variable  may  be 
computed  in  every  point  of  this  domain).  Declarations 

VARIABLE  First,  Last  DEFINED  ON  Square 


defines  variables  First,  Last  on  domain  Square;  it  means  that  the  values  may  be  assigned  to  these 

variables  in  every  point  of  domain  Square  for  A:  =  1  ,...15, /=  1  ,...5. 

Calculating  formulae  obtained  by  technical  expert  are  usually  written  in  the  form  of  relations. 
E.g.  calculating  formulae  for  the  solution  system  of  linear  equations  by  method  of  Gauss-Jordan  has 
the  form; 


= 

,,  y  =  1- 

..X 

/  =  1,... 

,N; 

= 

=  w,_ 

UJ  ^ 

7  =  1. 

...M, 

/  =  1, 

n.t 

= 

/  =  1.. 

">>4.4 

= 

7  =  1 . N, 

/  =  1, 

. /-1,/+1 

r  =  i 

r,,  = 


-V, 


/  =  i . f  = 


=  Tv.,. 


/  =  1 . N\ 


Extract  from  the  NORMA  program  is  given  below: 


Ot:(t=0..n) .  Oi:(i=1..n) .  Oj:0=l..n). 

Oij:(Oi;Oj) .  Otij:(Ot;Oii) . 

Oti:(Ot:OI) .  Otij1:Otij/t*l..n.  Otil:Oti/t=l..n. 

DOMAIN  PARAMETERS  n=10. 

VARIABLE  a  DEFINED  ON  Olj.  VARIABLE  m  DEFINED  ON  OtO- 
VARIABLE  b  DEFINED  ON  01.  VARIABLE  r  DEFINED  ON  Oti. 
INPUTaONOij,  b  ON  01. 

OUTPUT  X  ON  01. 

FOR  Otij/t=0  ASSUME  m  *  a. 
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FOR  Otl/t*0  ASSUME  r*b. 

OtiEQtjl ,  OtiNEtjl  /i=t.  OiEQtll ,  OINEtll  :Otn  /i=t. 

FOR  OtIEQtIjl  ASSUME  m  * 

FOR  OiEQtn  ASSUME  r  =  r[t-l  ,I=tl/m[M 

FOR  OtiNEtjl  ASSUME  m*m[t-ll-m[t-l,j=t]*mll=tl. 

FOR  OINEtll  ASSUME  r  *  r[t-ll-m[t-l,j*t]*rll*tl. 

FOR  01  ASSUME  x  *  r[t=n]. 

Necessary  computations  are  described  by  ASSUME  operator 

FOR  domain  ASSUME  relation. 

This  operator  is  a  key-notion  of  the  NORMA  ianguage.  The  relation  set  the  rule  of  computing 
the  variable’s  value  from  the  left  part  on  the  values  of  the  variable  from  the  right  part  and  index 
dependencies  between  the  variables.  The  variable  from  the  left  part  is  to  be  computed  in  all  the 
points  of  the  domain;  the  rule  for  this  computation  is  defined  definitely  but  immediate  fulfilment  of  the 
computation  isnl  required  in  the  given  place  of  the  program  and  the  method  and  the  order  of  the 
computation  isn’t  set.  Some  formal  similarity  with  assignment  operator  mustn’t  mislead  you. 

Indices  without  displacements  in  the  formulae  notations  may  be  omitted  because  they  are 
automatically  restored  by  the  translator  in  analysing  the  program. 

There  are  also  conditional  domains  used  in  the  considered  extract.  Thus  definition  OtiEQtiJ, 
OtiNEtijl:Otijl/l=t  determines  two  disjoint  subdomains  OtiEQtIJI  and  OtiNEtijI.  The  first 
subdomain  consists  of  the  points  from  modified  domain  Otijl  where  given  condition  i*t  is  true  and 
the  second  of  the  points  where  this  condition  is  false  (i.e.  fctf).  In  general  case  the  condition  in 
declaration  of  conditional  domains  is  represented  by  logical  expression. 

The  differences  in  the  given  above  two  ways  of  writing  calculating  formulae  are  in  the  form  of 
writing  (index  representation,  linearity  of  the  specification)  but  they  are  equivalent  in  their  contents. 

Special  instruction  ITERATION  is  used  in  the  NORMA  language  for  the  description  of 
iterative  processes. 

Declarations  of  input  or  output  variables  are  used  in  their  usual  way.  E.g.  declarations 

B/z<Eps. 

INPUT  Velocity  ON  A.  OUTPUT  Tau  ON  Bl. 

INPUT  X,  ALFA. 

are  the  requests  for  input  of  the  values  of  scalars  X,  ALFA,  variable  Velocity  in  all  the  points  of 
domain  A  and  also  for  output  of  the  value  of  variable  Tau  in  all  the  points  of  domain  B  where 
condition  Z<Eps  is  satisfied. 

The  order  of  input  or  output  of  the  variables’  values  isn’t  defined,  it  is  determined  in  the 
process  of  translation  of  the  program  automatically  (it  requires  special  processing  of  the  data  files  by 
the  translator). 

To  finish  the  introduction  of  some  NORMA  facilities  we  consider  reduction  functions.  They  are 
analogies  of  the  traditional  mathematical  notations  X  ’ll  ^’^®®® 

functions  has  the  form: 

mme-of-fimction  ((name-of-domain)  arithm-expression). 

Domain  determines  the  set  of  the  domain’s  points  where  the  function  is  to  be  computed, 
arithmetical  expression  is  a  set  of  the  values  which  the  function  is  to  be  applied  to.  Let’s  compute 
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The  description  of  this  computation  in  the  NORMA  language 

VARIABLE  A  DEFINED  ON  Grid  :  (Oi:(N1  ..N);Oj:(J*l  ..M)). 

VARIABLE  V,W  DEFINED  ON  Ol.  VARIABLE  X  DEFINED  ON  Oj. 

FOR  Ol  ASSUME  V  «  W+sum({Oj)  A*X). 

coincides  (up  to  the  notations)  with  the  original  calculating  formula.  Reduction  functions’  realization 
for  different  computational  environments  isn’t  an  easy  problem  but  this  problem  is  solved  by  the 
translator  not  the  user. 
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2  Semantics  of  the  NORMA  language 

Formal  semantics’  specification  of  the  NORMA  language  versions  1-22  [7]  are  given  in  this 
part.  Semantics’  specification  is  based  on  the  application  of  operational  approach  [8,9]  and  formal 
methods  of  relational  algebra  [10,11]. 

2.1.  Background  information 

2.1.1  Notations 

There  are  some  notations  used  below: 

R  -  set  of  real  numbers: 

N  -  set  of  natural  numbers; 

I  -  set  of  integer  numbers; 

T  -  set  of  program  P  identificators; 

F  -  set  of  program  P  arithmetical  expressions: 

L  -  set  of  program  P  logical  expressions ; 

0  -  empty  set; 

A  -  value  isn’t  defined; 

•  -  operation  of  strings’  concatenation; 

5,  X  -  relation  of  occurrence  of  substring  into  string  S2 : 

||y4j|  -  power  of  set  (or  relation)  A] 

~  -  “is  by  definition”: 

:=  -  “make  equal”; 

T,  F  -  logical  values  TRUE,  FALSE. 

The  construction  of  the  choice  has  the  form: 

(/’i  ->•  O'l'yPi  02,  -  P„  ->  0„^ 

where -  finite  number  of  statements,  any  objects.  It  determines  the  first 

(counting  from  the  left  to  the  right)  object  0,  which  has  /^  =  T .  Besides 

•  if  all  =  F,  /  =  !,..,»  then  the  construction  of  choice  is  an  empty  object; 

•  if  the  variables  Included  into  P^  are  used  in  setting  the  object  0,  then  the  values  assigned 
to  these  variables  have  =  T ; 

•  if  terms  of  type  are  included  into  P^  the  usage  of  the  variable  x  in  setting  object 

O,  means  that  all  the  values  may  be  assigned  to  this  variable  (in  the  arbitrary  order)  when 
F(x)=T; 

•  if  terms  of  type  3xF (x)  are  included  into  P^  the  usage  of  the  variable  x  in  setting  object 
O.  means  that  any  value  may  be  assigned  to  this  variable  (but  only  one)  when  F[x)  =  T  . 

2.1.2  The  elements  of  relational  algebra 

Give  some  definitions  according  to  e.g.[10]. 

D  -  some  (may  be  identical)  sets. 

R  is  said  to  be  a  relation  on  sets  ^^1,  if  c Z),0...0Z)jt  where  0 

operation  of  Cartesian  product. 

Sets  Z)„. ..,!>*  are  called  doma/ns  of  relation  i?. 

Elements  of  relation  R  which  have  eD,,  /  =  A  are  caiied  tuples  of 


relation  R. 
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Specification  of  type  R{A^,...,A^)  are  called  scheme  of  relation  R  and  4  's  an  attribute  of 
relation  R. 

Two  types  of  operations  are  defined  on  relations:  traditional  operations  on  sets  (union, 
intersection,  Cartesian  product  etc.)  and  special  relational  operations  (choice,  filtration,  projection, 
interconnection). 

Let’s  introduce  the  following  notations: 

Ry,R^,R  -  names  of  relations, 

A  -  attribute  of  relation  R, 

C  •  logical  condition, 
a,  3  -  tuples  of  relations. 

i?,  0  *  operator  of  relations'  Cartesian  product. 

Arguments:  relations  R^  and  R^ . 

Result  extended  Cartesian  product  of  relations  R^  and  .  If  i?,  =  then 

R^  0/?j  =a,  op„a,  op2,...,ai  op„....,a„  oPj,a„  °p2,...,a„  op^. 
i?,  ADD-TO  -  operator  of  adding  relation  R^to  R^. 

Arguments:  relations  i?,  and  R^  compatible  by  union  i.e.  having  equal  numbers  of  attributes  but  for 
i=1,...,n  the  value  of  one  and  the  same  domain  Z),  is  assigned  to  /-attribute  of  R^  and  R^ . 

Resut  relation  R^  added  with  the  tuples  of  R^  (without  repetitions). 

R1-R2  -  operator  of  7?,  and  R^  relations'  difference. 

Arguments:  relations  R^  and  compatible  by  union. 

Result:  relation  R  containing  the  tuples  of  relation/?,  which  aren’t  included  into  relation  7?^ . 

F(/?,0  -  operator  of  filtration. 

Arguments:  relation  R  and  logical  condition  C  built  from  constants  and  attributes  of  relation  R  by 
means  of  comparison  operations  =,9t,  >,<,>,<  and  logical  operations  v,a,-i  .. 

Result  relation  consists  of  those  tuples  of  relation  R  where  condition  C  is  true. 

R{A)  -  operator  of  Interception. 

Arguments:  relation  R  and  its  attribute  A. 

Result  relation  consists  of  the  values  of  attribute  A  only  in  relation  R  (without  repetitions). 

R(A)  •  operator  of  projection. 

Arguments:  relation  R  and  its  attribute  A. 

Result:  relation,  consists  from  the  tuples  of  relation  R  where  the  values  of  attribute  A  (without 
repetitions)  are  deleted  from, 
a  o  P  -  operator  of  tuples'  interconnection. 

Arguments:  a  =  (a^,...,a„)  and  P  =  {bj,...,b)^. 

Resu/f  tuple  a ° P  =  a, ,6, . 

/?,  o/?,  -  operator  of  relations’  interconnection. 

Arguments:  /?,  =  {a,, a, . aj  and  /?,  =  {p,,p,,...,pj  .  /?,  and  R,  -  relations  with  equal  power 

Result,  relation  R  =  {a,P],a,P2,...,a„P„} . 
f(R)  -  operator  of  relations’  transformation. 

Argument  /?=  {a,,a2,...,a„} 

Result  relation  R  =  {f(a,),f(ct2)>  ->f(a„)}  • 

UPDATE(a,y9,/?)  -  operator  of  tuple’s  renewal. 

Arguments:  relation  R,  tuple  a  =  (a„...,a„)  s/?,  tuple  p  =  (p„...,p„)  where  the  values  of  the 
same  domain  Z),  are  assigned  to  a,  and  P, . 
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Result  relation  R  where  tuple  a  is  substituted  by  tuple  P . 

Let’s  consider  arranged  relation  n(/),  I  defined. 

2.1-3*  Principal  notions  and  definitions. 

denoted  as  and  the  language  engendered  by  this  grammar  •  V 

will  be  typed  Ir  Italic  e.g.:  main-part,  declaration-of-domam. 

The  process  of  P  e  Z,(G^^)  program  compulation  Is  come  to: 

.  transformation  of  P  to  standard  form  P.  by  sequence  U„...,U,  of  preprocessor’s 

operators  design  absfracf  program  A(P)  on  P,.  tbe  abstract  program  is  represented  In  tbe  form  of 
rpiations  Domains  Variables,  Input,  Output,  InputData,  Relations-, 

relat.onsD«  of  abstract  program  ^(P)  by  abstract  AM  machine  ft  .s  come  to  camy.ng 

out  seouence  0  0,  operators  of  the  relational  algebra  and  modifying  the  relations  given  above. 

reZoess  ofimemriatlon  consists  of  tbe  sequence  of  srages.  The  denmtlon  of  Interpretatron  s 

staoe  is  oiven  below  in  the  description  of  the  rules  of  AM  machine  work. 

^  Thus  the  semantic  Sem{P)  of  the  program  P  is  determined  by  the  following  process. 

&m(P);  P,:=Pre(P).  4(F):=  7>(P,),  AM(A(P)). 
where  Pre  Is  a  preprocessor  transfoiming  P  to  equivalent  standard  loim  P/,  ^ 

Tr  IS  a  translator  prepadng  abstract  program  A(P)  for  AM  machine  on  standard 
rscresentaticn  P, .  Functions  of  preprocessor  Pre  and  translator  Tr  are  defined  in  the  next  cafe 

Operatorsf/,,  J  =  and  Q,  /  =  may  be  cond/frona/  and  unconditional.  The 

transformation  of  pr^ram  P  is  the  result  of  carrying  out  unconditional  operator  £/,.  Changing  of 
relations'  tuples  Is  the  result  of  carrying  out  conditional  operator  O,.  Value  T  or  F  is  the  result  of 

carrying  out  Uj  or  0, . 

Program  P  contains  semantic  error  if 
3j:  =Fv3/;  G=F 

Pragram  P  is  semantically  conect  If  there  is  no  semantic  errors  in  it.  v-  P  if  the 

Variable  .V,  of  program  P  has  infoimatlon  dependence  on  variable  .T.  of  program  P  rf 

value  of  X,  or  the  value  of  X,  which  has  Information  dependence  on  X,  is  necessary  for  the 

computation  of  X,  value.  Variable  X,  has  fnfbmrafion  Independence  if  there  rs  no  variable  X,  o 

program  P  which  variable  .V,  depended  on.  The  value  of  X,  is  rfirecffy  oompufabfe  rf  n  has 

information  independence  or  the  values  of  all  the  variables  which  X,  ,s  depended  on  have 

been  already  comped,  mac^i^^  Is  a  non-determined  device:  semantics  SeniiP)  maxes 

rpe7a=:raraS^^^^^^^ 

oependencles  between  the  variables  of 

program  P  Is  Mid  to  be  natural  (ideal)  parallelism  of  program  P.  Informally  natural 

mat  at  everv  moment  the  values  of  all  the  directly  computable  vanables  of  program  P  are  being 

rmputrProm  drls  point  of  view  semantics  &m(P)  may  be  considered  as  me  semarrfrcs  of 

natural  parallelism  as  in  padlcular  AM  realises  the  natural  parallelism  of  program  P. 
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Let's  define  the  way  of  representing  syntactically  correct  program  P  (syntactically  incorrect 
programs  aren’t  considered  as  they  don’t  belong  to 

Define  set  E  of  elementary  objects,  set  H  of  compound  objects  and  set  O  of  language 
)  objects: 

E  =  {identifier,  key-word,  constant,  sign-of-operation,  delimiter}', 

H  =  {program,  part,  declaration,  operator,  ....  declaration-of-domain,  declaration-of- 
input,...,  arithm-expression, . . .} 

0=EU  H 

Informally  set  H  consists  of  the  language  constructions  engendered  by  all  the  non-terminals  of 
grammar  Gyo„„^  except  non-terminals  included  into  E  and  principal-symbols. 

Consider  system  (0,S),  S  is  a  set  of  selectors.  _ 

Let  is -a  be  a  predicate,  defined  on  O,  is-a:0-^(T,F}.  Selector  ot  is -a  is  said  to  be 

subdomain  O  satisfied  predicate  is-a . 

E.g. 

is  -  identifier  =  (xj  is  -  identifier{x)} 

For  the  case  when  selector  is-a  contains  the  only  element  a  let’s  use  notation  a  (where  it  doesn’t 
cause  ambiguity).  E.g.  terminals  is-{  and  /5- BEGIN  will  be  represented  as  (  and  BEGIN 
correspondingly. 

Program  P  is  corresponded  to  the  system  (Op,Sp),  0^  Cl  O,  C  S  and  it  is 
represented  in  the  form 

/>  =  o,  *02  //>l,  o.eOp  (5) 

and  the  application  of  selector  s  e  to  a  program  (or  its  part)  in  the  form  (2)  is  represented  in  the 

form  s{P)  (correspondingly  5(0),  o  e  Op ). 

Let’s  introduce  some  auxiliary  operators  and  functions. 
positionip.,o)  -  function  which  determines  the  number  of  subobject  o^  in  object  o  for  representation 
of  (5)  type 

position{o.,o)  =/ 

Object  o,  is  said  to  be  to  the  left  of  object  o,  in  object  o  (notation  o^  o, )  if 
position{Oj,o)  < position{Oj,o) . 

Replace{o^,o^,o)  -  operator  of  substitution  instead  of  o,  in  o  ■,Oy,o^,o  e  Op. 

Replace(o, ,02,o)~(o  =  A  *0,  •  B-^  A  *02  •  B) 

Replace All{o^.o,,o)  -  operator  of  substitution  o,  instead  of  all  the  occurrences  of  o,  into  o; 
o,,o,,o  e  O^. 

ReplaceAll(o^ , o, , o)  ~  ( Vo, (o,  x o)  ->  Rep/ace{o^ ,02,0)) 

Delete(p^,o)  -  operator  of  deleting  the  occurrence  of  o,  into  o  ;  o,,o  s  . 

Deleteip.^  ,o)~-(p=  A*  o^•  B  A*  B) 

DeleteAll(p2,o)  -  operator  of  deleting  all  the  occurrences  of  o,  into  o ;  o,,o  €  Op . 

DeleteAll{Oj,o)  ~  (Vo,(Oj  x  o)  ->  Deleteip^,o)) 

Include(p2,o)  -  operator  of  including  o,  into  o,  keeping  syntactical  correctness  of  object  o; 

0,,0€  Op. 

tiname(P)  -  generation  of  the  name  unique  in  program  P. 
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uname{P)  -o,  o^iTs-  identifieriO)  a  -Bip{p  e  Op/\o  =  p). 

2.2  Relations  of  abstract  program  ^(P). 

Domains  {Cond,  D,  Iname,  1):  domains  of  program  A{P). 

Cond  6  L  -  logical  expression. 

Z)  e  T  -  identifier  of  domain. 

Iname  =  /,<>... <>/„,  /,  e  T  -  identifier  of  domain’s  indices,  n  -  dimension  of  domain  D. 

/  e  Z),  ®. .  ,  Z),  €  N  -  index  of  domain  D.  The  values  of  D  domain  (sets  of  integer  numbers) 

are  values  I. 

Tuple  (^Cond  o  D°  Iname  ol)=  {Condo  Do  belongs  to  relation 

Domains,  if  point of  index  space  Z,o...oZ„  belongs  to  domain  Z). 

Variables  (D,  X,  Iter)-,  definition  domains  of  program  A(P)  variables. 

Z)  e  T  u  0  -  identifier  of  domain. 

X  €  T  -  identifier  of  variable. 

Iter  €  T  -  identifier  of  iteration  index. 

Tuple(Z)o  XoA)  belongs  to  relation  Variables  if  variable  X  is  defined  on  domain  D.  If 
variable  X  is  indicated  in  the  list  of  iterated  variables  with  iteration  index  Iter  then  tuple  of  relation 
Variables  has  the  form  {Do  Xo Iter) . 

Input  (Var,  Value,  File),  input  variables  of  program  A(P). 

Var  =  X{I),  X  -  identifier  of  an  input  variable,  I  e  D^®...®D„,  Z),.  €  N  -  index  of 

input  domain  D  (set  of  integer  numbers). 

Value  e  RU  A  -  values  of  input  variable  X with  indices  Z. 

File  -  name  of  an  input  file. 

Output  {Cond,  Var,  Value,  File):  output  variables  of  program  A(P). 

Cond  e  L  -  logical  expression. 

Var  =  X{I),  JSr  e  T  -  identifier  of  an  output  variable,  Z  €  Z>,®...®Z)„,  Z),  e  N  -  index 

of  output  domain  D  (set  of  integer  numbers). 

Value  e  RU  A  -  values  of  output  variable  with  indices  I. 

File  -  name  of  an  output  file. 

InputDala  (Var,  Value,  File),  the  values  of  input  variables  of  program  A{P),  stored  in  the  external 
flies. 

Attributes  are  coincided  with  the  attributes  of  relation  Input  except  the  set  of  values  of 
attribute  Value:  Value  e  R . 

Relations  (Name,  ItVal,  Def,  Cond,  Var,  Value,  Func):  computed  variables  of  program  .4(Z*). 
Name  e  T  -  identifier  otA{P)  program’s  part. 

ItVal  €.  [Itid  •  C}  ^  {0  0}  -  character  of  computation  in  construction  iteration 

{ItVal=  {ItId»C])  or  outside  iteration  ItVal  =  (0  0};  Itid  s  T  -  identifier  of  iteration  index. 
C  €  I  -  the  value  of  iteration  index. 

Def  €  L  -  logical  expression. 

Cond  e  L  -  logical  expression. 

Var  =  Xff)...X^{f),  k>\-  variables  computed  in  part  Name.  X,  e  T  i  =  \,...,k  - 
identificatore  of  variables:  Z,  e  D,®...®D„,  i  =  1,...,^,  A  «  N  -  the  values  of  computation’s 
domain  D^®..®D„  (sets  of  integer  numbers). 
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Value  =  Value^,..., Value Value ^  €  RUA  i  =  values  of  computed  variables 

Var  =  X,(I,)...X,(I,),  k>l. 

Func  €  F  -  type  of  functional  dependence  for  variables  Var. 

The  tuples  in  every  relation  defined  above  are  considered  arranged  in  some  fixed  way. 

2.3.  Environment  of  abstract  AM  machine. 

Environment  of  abstract  .(4A/  machine 

Env  =  {Domains,  Variables,  Input,  Output,  Relations,  InputData,  IndSpace,  Files) 

IndSpace  =  {IndSpace,},  i  =  \,...,k,  A  -  number  of  program parts; 

IndSpace,  =  name-part^  •  oj  •  Oj  .  .•  o'„,  n>  0, 
o’j  e  is  -  name-index(  part, ). 

Files  -  the  names  of  external  data  files. 

Files  =  {filename^ filename, } ,  / ^  0. 

2.4  Definition  of  preprocessor  Pre(P). 

Preprocessor  Pre  translates  the  text  of  some  program  P  into  equivalent  standard 
representation  P^  or  fixes  semantic  errors. 

Pre{P)~  MA1N-EXIST(P); 

(  Vo(o  e  75  -  part{P))  -> 

LOCAL-VARIABLES-RENAME(o); 

NONREC-DOMAlN-DECLARATION(o); 

EXIST-UNIQ-DECLARATlON(o); 

MACRO-INI>EX-EXCLUDE(o); 

DEF-INDEX-SPACE(o); 

PAR-DOMAIN‘EXCLUDE(o); 

CONST-EXPR-^ALUATE(o); 

STANDARD-DOMAIN(o); 

STAND  ARD-ASSUME(o)) 

2.5  Specification  of  preprocessor’s  operators. 

0  Checking  main  part  existence  MA1N-EXIST(P). 

Main  part  must  be  in  the  program  and  be  the  only  one.  _ 

MAIN-EXIST(/’)  ~  a!o(o  €  is -main- part {P)) 

~  Renaming  local  variables  LOCAL~VARIABLES“RENAME(o). 

Names  localized  in  the  parts  are  substituted  to  the  unique  ones.  The  names  of  parts  and 
functions  aren’t  renamed.  Localized  in  the  parts  names  of  input  variables  (scalare  and  variables 
defined  on  domain)  are  renamed  according  to  the  names  of  input  variables  set  in  the  input  files. 

LOCAL-V'ARIABLES-RENAME(o)  ~ 

(V/7(/7  6  is  -  name-main-  part  {o'))  ->•; 

V/7(/7e  is -name-simple  -  partjp)) 


Mpip  e  is  -  name-  fimction(o)) 
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yp(j?e  is ‘name-external  -  function{p)) 
yp(/7€  is -name-external -simple  -  part  jo))  ->■; 
yp(/7e  /5  -  name-reduction  -  functionjp)) 

V/?(/7  €  is -name-standard  -  functionjp)) 

V/7(/?  €  w  -  -  scalar  {o) 

A3«(Me  /.y  -  name-scalari  g))  a  3 v(v  g  is-name-file{p))) 
L0CAL-1N-NAMES(m,  V,  o,  F/7ej); 

€  /y  -  input  -  scalar  (o) 
a3m(m  g  15  -  name-scalari  p))) 

->  LOCAL-IN-NAMES(m,  ’norma.dat’.  o,  Files); 
yp(p  €  /5  -  w/w/  -  ow  -  domain{d) 

A\fu(ii  e  is-inptit-on-domain{p))A3v{v  e  /y  -  name-file{p))) 

->  L0CAL-1N-NAMES(m,  V,  o,  F/7ey); 
yp(/7€  is-input -on-domain(p) 

aV//(z/ G  is-inpnt-on-domain{p))) 

->  LOCAL-IN-NAMES(m,  ’norma.daf.  o,  Fr7ey); 
yp(/>  G  Ts- identifier  ip))  Replace  All  {p,uname{P),o)) 

Auxiliary  function  LOCAL-iN-NAMES(//,  v,  o,  F//<?y)  carries  out  agreed  renaming  of  the 
names  of  input  variables  u  localized  in  part  o  and  input  from  file  v,  and  the  names  of  input  variables 
set  in  input  files  Files. 

LOCAL-IN-NAME5(m.  v,  o.  Files)  ~ 

(  3p{p  G  Files  A  V  =  />  A  G  is-  name-scalari  p)  Aq  =  u)) 

t  :=  \mMAE.iP);ReplaceAlliq,  t,  p);  ReplaceAlliii,  t,  o); 

3p{p  G  Files  Av  =  p  A  3q3r{q  g  is  -  name-variable  -  on  -  domainj  p)  Aq-u 
Ar  G  is  -  list  -  range  -of  -  index  Aq*(*r  •))) 

->  RENAME-IND(r,  p,  o)) 

Auxiliary  function  RENAME-IND(r,  p.  o)  carries  out  agreed  renaming  of  input  variable's 
indices  r  localized  in  the  part  o  and  input  from  file p. 

RENAM£-IJMD(r,/?,  o) 

(Vw(m'G  is-name-indexjr) 

f  :=  UNAME(F);  ReplaceAlliw,  t,  p);  ReplaceAlliw,  t,  o)) 

0  Checking  non-recurrence  of  domains' declarations  NONREC-DOMAIN-DECLARATlON(o). 

Domain  with  name  o,  is  said  to  be  depended  on  domain  with  name  Oj  in  part  o  if  either  the 
name  of  domain  Oj  or  the  name  of  domain  O3  depended  on  is  included  textually  into  declaration- 
of-domain  o, .  Let  this  character  be  checked  by  function  DEPENDENCE-DOMAIN  (o, ,  o, ,  O3 ) . 
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DEPENDENCE-DOMAIN (o,, 02,03)  ~ 

3/;3^(  p  €  Ts- declaration -of  -domain{o) 

a(  q  ^  is-domain- product(p) Ap-o^•:•{•q•) 

a(o2  ccqv (O3  ozq  A  DEPENDENCE-DOMAIN (O3 ,0^,0) )) 
v^e  is-new-domam(p)Ap  =  o^*:»q 

a(p2  ocqvip^  ozq  A  DEPENDENCE-DOMAIN  (03,03 ,0) )) 

VO,  €  is -name -of  -conditional -domainip) 
a(o2  e  is -name -of  -domain{p) 

v(o3  e  is -name -of  -c/oma;/7(/>)  a  DEPENDENCE-DOMAIN  (03,03,  o) )))) 

There  must  be  no  interdependencies  of  the  domains  (recursive  declarations  of  domain). 

NONREC-DOMAIN-DECLARATION(o)  ~ 

Vo,  V02 (o,  ,03  e  is-  name  -  of  -domainjo)  a  o,  9^  Oj 

=  (DEPENDENCE-DOMAIN  (o,  ,03 ,0)  A  DEPENDENCE- 

DOMAIN  (03,0,,  o)  )) 

0  Checking  declaration’s  being  and  being  unique  EXIST-UNIQ-DECLARATlON(o). 

In  each  part  any  name  (except  name-of  index  iteration)  must  be  declared. 

£XlST-UNIQ-D£CLARAT10N(o)  ~ 

V/?(  pe  fs-  name  -  of  domainjo)  ^3\q{q<^Ts-  declaration  -of  -  domainip) 

A  NAME-DOMAIN-DECLARE(p.^)) 
v/7  e  is -name -of  scalar  jo)  s 

3!^3!r(q' €  is -name -of  scalar  jo) 

Ar  els-  declaration  -of  -  scalar  -  variablesio)  Aq  xr  Ap  =  q) 
s/p  e  is-  name  -  of  -  variable  -  on  -  domainjo)  = 

3 !  4/3 ! rjq  e  is  -  name  -  of  -  variable  -  on  -  domainjo) 

AT  els-  declaration  -of  -  variables  -  on  -  domains  jo)  Aq  <x:r  Ap  =  q) 
s/p  e  is-  name  -  of  -  index  -  constructionjo)  = 

31731  rjq  €  fs  -  name  -  of  -index  -  constructionjo) 

Ar  e  Is-  declaration  -of  -  index  -  constructionjo)  Aqccr  a  p  =  q) 
s/p  e  is-  name  -  of  -  domain' s  -  parametersjo)  = 

3!o31r(o  g  is -name -of  -domain's- parametersjo) 

Ar  e  is  -  declaration  -of  -  domain' s  -  parametersjo)  Aq<x.r  a  p  =  q) 
s/p  e  is-  name  -  of  -  external  -  function  jo)  = 

3\ql\rjqe  Is  -  name  -of  -  external  -  function  jo) 

Ar  els-  declaration  -of  -  external  -  Junctions  jo)  Aq<x:r  Ap  =  q) 
s/p  e  is  -  name  -  of  -  external  -  simple  -  part  jo)  s 
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3!^3!r(^  €  Is  -  name  -of  -  external  -  parts{o) 

Ar  G  is-  declaration  -of  -  external  -  simple  -  parts(o)  /\qccr/\p-q) 

V  p  €  is-  name  -of  -  indexjp)  = 

3^3r  €  S  -  name  -  of  -indexjp) 

Ar  G  is-  one  -  dimensional  -  domainjo)  Aqxr  a  p  =  9)) 

Function  checking  name-of-domain  declaring  in  declaration-of-domain  q 

NAME-DOMAlN-DECLARE(p,g^)  ~ 

35(r  G  is-s(q)Aq  =  p*:»r) 

v35,3j2(r  G  is-sfq)  au  €  is  -Sjjq)  A(q  =  p  •  ,  •  u  • :  •  r  v  q  =  u  •  ^  •  p  • :  •  r)) 

□  Eliminating  macroindices  MACRO“INDEX-EXCLUDE(o). 

Any  name-of-index-construction  included  into  index  expressions  is  substituted  to 
corresponding,  list-explicit-id-expression  which  is  declared  in  declaration-of-index-construction. 
Then  declaration-of  index-construction  is  eliminated  from  the  text  of  the  program. 

MACRO-lNDEX-EXCLUDE(o)  ~  ( Vp(p  G  is  -  declaration  -of-  index  -  constructionjo) 

Aq  e  is-  name  -of  -  index  -  constructionjp) 

Ar  E  is-  list  -  explicit  -  ind  -  expressionjp)) 

->  ReploceAll{q,r,o),Delete{p,o)) 

□  Constructing  index  space  DEF-INDEX-SPACE(o). 

The  order  of  directions  (of  axes)  of  index  space  coincides  with  the  arrangement  of  the  index 
names  in  declaration-of-domains'-indices  (from  the  left  to  the  right).  If  there  is  no  declaration-of- 
domains  ’-indices  then  the  order  of  the  directions  is  defined  by  the  arrangement  of  names-ofz 
indices  in  the  text  of  the  program. 

DEF-IND-SPACE(o)  ~ 

T  :=  6ET-PART-NAME(o); 

(ypjp  G  Ts- declaration -of  -  domains' -indicesjo)  a  q  eis-Wst-riame-of  -indexjp)) 
ReplaceAllj^  q)',  AddListIndexjq,  T);  Delete jp,  o); 

V  qjq  G  is  -  name  -of  -  index  jo)) 

->  Addlndexjq,!)): 

IndSpace :  =  IndSpace  { r} ; 

Function  S£T-*?ARTNAME(o)  give  the  name  of  part  o 
6LET-PARTNAME(o)  ~ 

jis  -  main  -  part  jo)  a  3  njis  -  itame  -  of  -  main  -  partjn)  a  MAIN  P  ART  •n'xo) 

->  n; 

jis -simple  -  part  jo)  a  3«(/5  -  name  -  of  -  simple  -  part  jn)  a  PART  •nxo) 

-^n; 

jis  -  part  -  fiinctionjo)  a  3  njis  -  name  -  of  -  part  -  function  jn)  a  FUNCTION  •nxo) 

->  n  \ 
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OpQxaW  AddListIndexiq.T)  puts  list  of  indices  q  into  index  space  J^and  if  relation  q<„r 
is  true  then  ^-<7-  r  where  r  g  is  -  list  -  name  -of  -  index{o) . 

OperaAor  AddIndex{q,T)  puts  name  of  index  ^  into  index  space  T  (if  it  hasn’t  been  there  yet) 
and  if  reiation  q<„r  is  true  then  q  <r^  where  r  G  is  -  ncane  -  of  -  indexjp)  foro. 

□  Substituting  domains’  parameters  PAR'DOMAIN-EXCLUDE(o). 

The  values  of  domains  ’  parameters  are  substituted  instead  of  the  names  of  domains  ’ 

parameters  in  the  text  of  the  part. 

FAR-DOMAlN-EXCLUDE(o)  ~ 

(Vp(/7G  Ts- declaration -of  -  domains'  -parametersip) 

Aq  e  is-  name  -of  -  domain' s  -  parameterfp) 

Ar  e  is-  integer  -  without  -  sign(p)  Aq*  =  »rccp) 
ReplaceAll{q,r,o);Delete{p,o)) 

□  Calculating  constant  expressions  CONST-EXPR-RV ALUATE(o). 

The  values  of  all  the  constant  expressions  are  computed. 

CONST-EXPR-EVALUATE(o)~  (Vp(/?  G  is- constant -expressionip)) 

EVALUATE-EXPR  ip) 

^p{p  €  is  -  constaji!  -  expression  -  withnv.l  -  signin')) 

->  EVALUATE-EXPR  (p); ) 

Function  EVALUATE-EXPR  (e)  computes  the  value  of  expression.  Semantics  of 
computing  the  expression  has  traditional  meaning  and  may  be  defined  by  a  well-known  algorithm  of 
translating  arithmetical  expression  into  Polish  notation  [12]  and  the  rules  of  this  notation’s 
interpretation. 

~  Making  domains  standard  ST AND ARD-DOMAIH(o). 

The  main  purpose  of  domains’  standardisation  is  to  reduce  all  the  declarations  of  rectangular 

domains  to  the  form 
declaration-of-rectangidar-domain: 

multidimetjsional-domain 

multidimensional-domain: 

one-dimensional-domain 

name-of-one-dimensional-domainl  {domain-product^ 
domain-product: 

component-domain  {  ^  component-domain  ) 
component-iiomain: 

one-dimensional-domain 

{domain-product) 

one-dimensional-domain: 

ttame-of-one-dimensional-domain:  {name-of-index=value) 


STANDARD-DOMAlN(o)  ~  IMPUCrT-NAMES(o); 

DEF-DOMAIN-EXCLUDE(o); 
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NAME-DOMAIN-REPLACE(o); 

RECT-N-DOMAIN-TRANSFORM(o) 

Function  IMFLICIT-NAMES(o)  carries  out  the  following  transformation;  unnamed  new- 

domaim-without-name  defined  in  operators  ASSUME,  INPUT,  OUTPUT  are  substituted  in  these 
operators  to  the  unique  names,  declarations  of  these  domains  are  included  into  the  program  under 
corresponding  names. 

IMFUCrr-NAMES^o)  ~ 

{yp({p  e  is- operator  -  ASSUME(o) 

Vp  €  is-  declaration  -of  -  input{o) 

V/?  e  is-  declaration  -of  -  output{o)) 

/<^q{q  G  is-  new  -  domain  -  without  -  name{p))) 

->■  name  ;=  uname{P); Replace{q,  name, p); Include(name  q,  o)) 

Function  l>EF-DOMAIN-EXCLUDE(o)  carries  out  the  foliowing  transformation:  declaration 
def  of  every  multidimensional-domain  with  name  name-of-multidimensional-domain  and  every 
one-dimensional  domain  with  name  name-of-one-dimensional-domain  used  as  the  component  of 
deciaration  def^  of  another  multidimensional-domain  are  included  as  the  addition  into  the  text  of 
part  o.  Name-of-multidimensional-domain:  or  name-of-one-dimensional-domain  included  into 
def  is  eliminated  from  def, . 

DEF-DOMAIN-EXCLUDE  (o)  ~ 

(Vp(  pe  is  -  declaration  -of  -  rectangular  -  domain{p) 

A{q  e  is  -  name  -of  -  multidimensional  -  domain(j>) 

vq  ^  is  -  ncm7e  -of-  one  -  dimensional  -  domain(p)) 
rBs^Bs^Ssfr^Q  is-sfp)Ar2e  is-S2ip)Ar^G  is-sfp) 

A/7  =  • :(  •  Tj  •  )  •  Tj)) 

-»■  Replace{q  • :(  •  z":  •  ),  (  •  /2  •  ),p);Inclitde{q  •'.{•r,  »  ),o)) 

Function  NAME-DOMAIN-REFLACE(o)  carries  out  the  following  transformation:  every 
name-of-multidimensional-domain  and  every  name  of  one-dimensional-domain  used  as  the 
component  of  declaration  def  of  another  multidimensionai-domain  is  substituted  to  corresponding  to 

these  names  declaration  of  multidimensional-domain  or  one-dimensionai-domain  in  declaration  def, . 


NA^flE-DOMAIN-SEPLACE(o)  ~ 

( 'ip'^qi  p  €  is-  declaration  -of  -  rectangular  -  domain{o) 

A{q  e  is- name  -  of  -mtdtidimensional-domain{p) 

vq  e  is  -  name  -of  -  one  -  dimensional  -  domain(p)) 
A(q»;^pvq*)a:p) 

a3  u  (NAME-DOMAIN-DECLARE(^,m)  AU  =  q»:»v)) 

->  Replace{q,v,p)) 
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Function  RECT-N-DOMAIN-TRAIiSFORM(o)  transforms  the  declarations  of  rectangular 
neM!  domains  into  equivalent  declarations  of  rectangular  multidimensional  domains. 

RECT-N-DOMAIN-TRANSFORM(o)  ~ 

(V/7(  pe  is -declaration -of  -  rectangular -domainip) 

Ais-  new  -  domain{p) 

A  3  m3  v(m  &  is-  name  -  of  -  rectangular  -  domain(p) 

A  V  e  is-\\st-modification(p)  Ap  =  q»:*u»l  •v) 

->  {is  -  name -of  -  one  -  dimensional  -  domain{u)  a  3  w  (NAME-DOMAIN-DECLARE(m,  w)) 

Replace{  q  • :  •  u  •  I  •  v,  q  •  :  •  MODIFV(w,v,o),  p)', 
is  -  name-  of  -  multi  -  dimensional  -  domain{u)  a  3w  (NAME“DOMAIN“DECLARE(m, 
w)) 

3s(s  e  is  -  name-new  -  domain(w)) 

->  NAME-DOMAIN-REPLACE(o); 

is  -  name-  of  -  multi  -  dimensional  -  domain{u)  a  3  w  (NAME-DOMAIN-DECLARE(m, 
w)) 

-> Replace{ q*\*u*  I •  v,q*’.*  MODIfY(H',v,o), /?))); 
NAME-DOMAIN-REPLACE(o) 

Function  MODIFV(w,v,o)  chooses  the  next  element  from  the  list  of  modifications  v. 
MODIFY(w,v,o)  ~ 

( is-modification{y)  MDFONE(>v,v,o); 
is  -  Wst -modification{v)  MDFONE(H',/feflK/(v),o);  MODlFV'(w,/a/7(v),o)) 

Functions  MDFONE(w,v,o)  and  CORRECT(«,g)  correct  the  domain  according  to 
modification  v. 

MDFONE(w,v,o)  ~  _ 

{3m3n{me.  is-name-index(v)  An  g  is-value{v)  AV-m  •  —  •  n 

A3a(a  e  is -  value{w)  Aw-m  •  =  •  a) 

->  Rcplacc{m  •  =  •  a,  m  •  =  •  n,  ir); 

3m3g{m  e  is  -  name-of  -  one  -  dimensional  -  domain{v)  Ag^0Av  =  m*g 

A  3c3c/(c  s  is  -  name-  index(o)  a  t/  €  is  -  valiie{o)  a  m  •  :{  •  c  •  =  •  d  ) 

->  MDFONE(u',  c  •  =  •  CORRECT(c/,^f))) 

CORRECT(w^)~ 

( 3a3b{is  -  int-  constant(a)  Ais-  int-  constant(6)  a  /?  =  a  • . .  •  6) 

->  (3c(/5  -  int-  constant(c)  Ag  =  +LEFT(  •  C  •  )  ->^  evaluate  -  expr(a  -c)*  b) 
3c{is-  int-  constant(c)  a  g  =  -LEFT(  •€•)->  evaluate  -  expr(a  +  c)  •  •  6) 

3c(is- int-  constant(c)  Ag=  -RIGHT(  •  C  •  )  ->  a  • •  evaluate  -  expr(Z»  -  c)) 
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3c(/5-  int-  constant(c)  Ag=  +RIGHT(  evaluate  -  expr(^  +  c))) ) 

□  Making  ASSUME  operators  standard  standard-ASSUME(o). 

Every  operator  ASSUME  consists  of  several  relations  is  transformed  into  equivalent 
sequence  of  operators  where  each  operator  ASSUME  consists  of  a  relation. 

standard-ASSUME(o)  ~  _ 

(V/7(/7€  is  -  operator -ASS\itAE{o)/\q  e  is  -  name-domaini p) 

aFOR  •q*  ASSUME  Qc p) 

ReplaceAll{y,.  FOR  •  ^  •  ASSUME,/?) ) 
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2.6  Definition  of  translator  TriP,) 

Translator  Tr  carries  out  preparation  of  abstract  program  A(P)  for  AM  machine  on  standard 
representation  P,  or  fixes  semantic  errors. 

Tr(Ps)  ~  (Vo(o  €  part{P)) 

RECT-M-DOMAIN-E\^ALUATE(o); 
D1AS-D0MAIN-E^^ALUATE  (o); 
VAR-DECLARATION(o); 
REBUILD-INDEX(o); 
COND-DOMAIN-EVALUATE(o); 
INPirr-DECLARATlON(o); 
OUTPUT-DECLARATlON(o)); 
INPUT-DATA-PROCESSINQ(F/7e5); 
CHECK-INPUT-DATA(P); 
CALL-€RAPH-CREATC(P); 
CHECK-CALL*SRAPH(P); 

(Vo(o  e  is-pca-t{P))  -> 

OPERATORS-TO-RELATIONS  (o)) 


2.7  Specification  of  translator’s  operators 

~  Computing  rectangular  multidimensional  domains  RECT”M-DOMAIN-EVALUATE(o) 

Declarations  of  rectangular  multidimensional  domains  are  transformed  into  the  tuples  of 
relation  Domains.  The  order  of  indices  in  domains’  declaration  is  determined  by  the  indexes’  order 
set  in  Indspace.  Coordinates  of  the  points  from  index  space  making  up  rectangular  multidimensional 
domain  are  calculated  statically. 

Function  RECT-M-DOMAIN-EVALUATE  uses  supplementary  structures: 
DomamNotOrdered{q)  -  ^  »  (m,  ,  v, )  «>  .•  •  • »  (h„  ,  v„  ) ,  where  (6) 

q  -  name-of-domain. 

,  V;  -  name-of-index  and  its  value, 
n  -  dimension  of  domain; 

DomainOrdered{q)  -  qo  Iname  o  v,  o  • . .  o  v„ ,  where  (7) 

q  -  name-of-domain. 

-  name-of-index  and  its  value,  Iname  *  where  if  Vi  <i„dspace(o)' 

^ DomanOrderediq)  ’ 

n  -  dimension  of  domain. 

RECT-M-DOMAIN-E\^ALUATE(o)  ~ 

i^P(P  e  is  -  declaration  -of  -  rectangular  -  domain(o) 

A  is-mtdtidimensionaUdomain(p) 

Aq  €  is  -  name  -  of  -  domain(p)) 

->  DomainNotOrderediq)  ;=  q;  _ 

(Vi/Vv(m  €  is  -name- of  -index(p)  a  v  e  is-  value{p)  Au»  =  *v<xp) 
DomainNoiOrdered{q)  .=  DomainfloiOderediq)  °  (h,v)); 
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ORDER-INDEX  (PomainNotOrdered{q)JndSpace{o)J)omainOrdered{q)), 
MAKE-DOMAIN  WomainOrdered(q)Ji); 

T®R  Domains; 

Delete(p,o)) 

Function  ORDER-INDEX  {DomamNotOdered{q),  IndSpace{o),  DomainOderediq)) 
transforms  representation  (6)  into  (7). 

Function  MAKE-DOMAIN  {DomamOdered{q)Ji)  forms  auxiliary  ‘basic’  relation  R  for 
domain  q  on  representation  (7).  This  relation  differs  from  Domains  by  absence  of  attribute  Cond. 

MAKE-DOMAIN  (DomainOdered{q)Ji)  ~ 

R  :  =  q®  Iname{q)  0  6ENEBfiE(v, )  0  •  •  •  0  €Er®8SE(v^ ) 

6ENERATE(m  •  • .  •  v)  ~  F(n(/),  u<I<v) 

□  Computing  diagonal  domains  DIAQ-DOMAIN-EV^ALUATE(o). 

Declarations  of  diagonal  domains  are  transformed  into  the  tuples  of  relation  Domains.  The 
order  of  indices  in  domains’  declaration  is  determined  by  the  order  of  indices  set  in  IndSpace. 
Coordinates  of  the  points  from  index  space  formed  new  rectangular  domain  are  computed  statically. 

Function  DIAS-DOMAIN-EVALUATE  uses  the  representation  in  Domains  for 
unconditional-domain  p  obtained  earlier.  Then  unconditional  domain  p  is  modified  into  diagonal- 
domain. 

Differed  from  functions  RECT-M-DOMAIN-EVALUATE  and  RECT-N-DOMAIN- 
EYALUATE  function  DIAQ-DOMAIN-EVALUATE  puts  oniy  those  vaiues  of  indices  from  domain 
p  into  reiation  Domains  which  satisfy  conditions-on-indices  set  in  declaration  of  diagonal  domain. 

DIAe-DOMAIN-EYALUATE(o)  ~ 

{\fp{p  €  is  -declaration  -of  -  diagonal  -  domainip) 

a3  r/3  v3  q(is  -  name  -  of  -  unconditional  -  domain{p) 

Aq  €  is  -  name  -  of  -  diagonal  -  domain{p)) 

A  V  6  is-  list  -  condition  -  on-domain(p) 

Ap-q^:*u»  I  •v)) 

->  i?  :=  F(Domains(Cond),D  =  u); 

T®q®  F(/?, ReplaceAll(„A,v))  ADD-TO  Domains; 

Delete(p,o)) 

Z  Declaring  the  variables  YAR-DECLARATION  (o). 

Declaration  of  scalar  variables  and  variables  on  domain  are  transformed  into  tuples  of  relation 
Variables.  For  every  variables  on  domain  mentioned  in  the  list  of  iterated  variables  name  of  iteration 
index  is  indicated  as  attribute  Iter. 

VAR-DECLARATION  (o)  ~ 

(V/7(p  €  is  -declaration  -of  -  scalar  -  variablesip) 

aVm(w  e  is  -  name  -of  -  scalar  {p))) 


(Vr/(r/  €  a-header-of  -  iteration{p) 
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a3  r(r  e  is  -  name  -of  -  iteration  -  indexiq) 

Ai/oc^->0°i/°r  ADD-TO  Variables;  Delete(p,o); 

->  00  1/0  A  ADD-TO  Variables;  Deleteip, 6)); 

\lp{p  els-  declaration  -of  -  variables  -on-  domain{p) 

A  Vi/(m  €  Is  -  declaration  -of  -  variables  -  on  -  domain{p) 
aVv3  >v(v  eTs-  name  -  of  -  variable  -on-  domain{u)  a  w  e  is-  name  -  of  -  domaitiju)))) 

{\/q{q  elT-header-of  -iteration{o) 

a3  r(r  e  is-  name  -  of  -  iteration  - index{q) 

Avazq^wovor  ADD-TO  Variables;  Delete(p,o); 

w  o  V  o  A  ADD-TO  Variables-,  Delete(p,o))  ) 

n  Restoring  indices  REBUlLD“INDEX(o) 

In  using  variable  on  domain  the  default  rule  of  setting  index  is  nght  {index  expressions 

coincided  with  the  name  of  index  may  be  omitted).  Function  REBUlLD-INDEX(o)  restores  the 
indices  set  default  in  the  index  expressions  of  the  variables  declared  on  domains  based  on  the 
information  about  these  domains.  The  indices  indicated  as  the  formal  parameters  of  the  parts  or 
function,  actual  parameters  set  on  domain,  elements  of  the  list  of  iterated  variables  arent  to  be 

restored . 

REBUlLD-lNDEX(o)  - 

(  ^PiP  ^  -  c>/  -  variable  -  on  -  doniaiii{o) 

Aq  e  is-  list  -  ind  -  expressionjo) 

'dp*[*q*\<xzo) 

-¥  ReplaceAll{p  ffSUlID(q',  €Er-  INDES^/?))  •  ),<?), 

'^pip  €  is  -  name  -  of  -  variable  -on-  domain{o) 

a-i3  q{(q  e  is  -  formal  -  parameter  -of  -  part(p)  Apccq) 

v{q  e  is-  formal  -  parameter  -of  -  fimction{o)  Ap^q) 

V  {q  €  is  -  initial  -  parameter{o)  a  p*  ON  x:  q) 

'  V  (qr  e  is-  parameter  -  result{o)  a  p*  ON  x  q) 
v(q  e  is-  iterated  -  elementip)  Apccq)) 

RepIaceAll{p,p*{  •  ReplaceAll{o^,^^-  •  ),o)) 

Auxiliary  function  QET-INDEXO')  gives  indices  of  variable  v  in  accordance  with  the 
definition  domain  of  this  variable. 

eET-iNDEX(>’)  ~  7?,  :  =  ¥{}'ariables{D,  X,Iter),  X  =  y); 

R^  ;=  f{Domains{Cond,D,Iname,I),D  =  RfD)); 

(Rflter)  =  0  0  (Iname); 

R^  {Iter)  ^  0  0  {Imme)  °  i?,  {Iter)) 
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Auxiliary  function  RjEfiUILD(^,/7ia/we)  changes  the  list  of  index  expressions  q  to  Iname 
substituting  index  expressions  set  explicitly  from  q  to  Incane  instead  of  corresponding  indices 

(substitution  is  done  by  function  CHANGE). 

REBUlLD(^/waOTe)  ~  ReplaceAll{  o  ,Jncane), 

yp{p  €  is  -  ind  -  expression{q)  ->  QWI€E(//ia/77e,/?)) 


CHAN6E(/na/we^)  ~ 

{3  u3v3  w{is  -  ncane  -  0/  -indexju)  a  is  -  name  -  of -indexjy) 

A/s  -  int  -  constant(w')  a/>=w«=*v«+*w 
Replace{u,v  •  +*w,  Iname); 

3  u3  v3  w(/s  -  name  -  of  -indexju)  a  is  -  name  -  of -indexjv) 

A/s  -  int  -  constant(w)  a  p  =  u»=»v-»w 
->  Replace{u,v  •  -  •  w, Iname); 

3  v3  v3  ^(/s  -  name  -  of  -indexju)  a  is  -  name  -  of  -indexjy)  a//  =  //•=•  v 

Replace  ju,v,  Iname); 

3  n3  v3  wjis  -  name  -  of -indexju)  a  is-  int  -  constant(w')  a/>=//*=*v 

Replace  ju,v,  Iname); 

3  u3  v3  wjis  -  name  -  of -indexju)  a  is  -  int  -  constant(w)  a/7=//«=*v*-*w 

Replace  ju,v  •  -  •  w,  Iname); 

3  u3  v-3  M'(/s  -  name  -  of -indexju)  a  is  -  int  -  constant(M')  a  p=  ii*=*v+*w 
->  Replace  ju,  v  •  +  •  w,  Iname)) 

nComoutina  conditional  domains  COND“DOMAIN"EVALUATE(r>). 

Declarations  of  conditional  domains  are  transformed  into  the  tuples  of  relation  Domains. 
Coordinates  of  the  points  from  index  space  forming  conditional  domain  are  computed  dynamically  in 
the  process  of  AM  machine  working  at  the  expense  of  interpretation. 

COND-DOMAIN-E)^ALUATE(o)  - 
(  ^PiP  €  /s  -  declaration  -of  -  conditional  -  domain  jo) 

a3  u3  v3  ^3  lju  e  is  -  name  -  of  -  conditional  -  domain  jp) 

Av  s  is  -  name  -  of  -conditional  -domain  jp) 

AW  G  is  -  name  -  of  -  domain  jp) 

aI  6  is  -  condition  -  on  -  domain  jp) 

/\p  =  II  •  V  •  w  •  I  •  I )) 

RIname:=  FjDomains,D  =  w)jlname); 

RI:=  FjDomains,D=  h')(/); 

RCond:=  FjDomains,D  =  w)jCond); 

RYes.=  FEPliCL~M)-‘"^^l),RIname,RI); 

RNa=  FEnXM.~m-^^-.jl),l^name,RI); 

RCond  o RYes ®u®  RIname  o  AD D  -  TO  Domains ; 
RCond  o  RNo  0  v  0  RIname  o  /?/  AD  D  -  TO  Domains ; 
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Delete(p,o)) 

Function  REPLACErWD~\tAL(l,RIname,RI)  constructs  the  relation,  which  number  of  tuples 
is  equal  to  the  number  of  the  tuples  in  relation  RI,  and  the  tuples  themselves  are  the  result  of 
substituting  the  values  of  indices  I  instead  of  their  names  Imme  into  /  (the  order  of  substitution  is 
determined  by  the  order  of  the  I  attribute's  values ). 

0  Declaring  incuts  INPUT“DECLARATION(o). 

Declaration  of  input  scalars  and  inputs  on  domain  are  transformed  into  tuples  of  relation 
Input  and  the  tuples  of  relation  Relations. 

lNPUT-DECLARATION(o)  ~ 

n  ;=  get-partname(o); 

is  -  input  -  scalar{o)  a  3  u{is  -  name  -  of  -  scalar  ju)  a  m  «  p)) 

V  ;=  QET-lN-FlLENAME(p); 
uoAov  ADD -TO  Input, 

wo0  0oXoXoMo  AoINPUT  ADD -TO  Relations:, 

Delete(p,o); 

V/7(/7  e  is  -inputs  -  on  -  domainip) 

AVr/3  w{u  eis-  input  -on-  domain{p)  a  w  e  is-  name  -  of  -domain{p))) 

-^v=  QET-IN-FILENAME(p); 

RIname.-  ¥{Domains,D  =  w)(Iname); 

RI:=  ?{Domams,D  =  »■)(/); 

var:-  r/  •  (  •  Replace All(o,„RIname)  •  ); 

Var:=  fEEVCK-m-VNl(Var,RIname,RI); 

Var®  A0  V  ADD  -  TO  Input; 

W0000X0X0  Vcm  0  A  0  V  ADD  -  TO  Relations', 

Delete(p,o)) 

Auxiliary  function  QET-IN-FlLENAME(p)  gives  the  name  of  input  file  of  input  variable  p. 

QET-IN-f  ILEN  AME(/7)  ~  (  3  v(v  €  is  -  name  -  of  -  filejp))  ->  v; 

->  ’norma.dat’) 

Semantics  of  attributes  format  for  input  variables  isn’t  described  in  the  given  specifications  it 
is  equal  to  the  corresponding  semantics  of  the  FORTRAN  language. 

~  Declanna  outputs  OlJTPUT'DECLARATlON(o). 

Declarations  of  output  scalars  and  outputs  on  domain  are  transformed  into  the  tuples  of 
relation  Output. 

OUTPUT-DECLARATlON(o)  ~ 

(  ^  is-  output  -  scalarip)  a  3  u{is  -  name  -  of  -scalarju)  a  m  oc  p)) 

V  :=  SET-OUT-FILENAMEO); 

XomoAov  ADD-TO  Output;  Delete(p,d)\ 

'dpi p  G  is-  outputs  -  on  - domain(o) 
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aV2/3  w(u  e  is-  output  -on-  domain{p)  aw  e  is-  name  -  of  -  domain{p))) 

^  V  ”  6ET-IN-FlLENAME(p); 

RIname:=  F(Domains,D  =  w){Iname); 

RI:=F{Domains,D=w){I); 

var:  =  m  •  ( •  ReplaceAll(°„yRlname)  •  ); 

Far;=  WFlXErM>~"^var,RIname,Riy, 

RCond oVar  ®  K®  V  ADD -TO  Output', 

Deleteip,o)) 

Auxiliary  function  6ET-0UT-FILENAME(p)  gives  the  name  of  output  file  for  variable/?. 

QET-OUT-FILEN  AME(p)  ~  (  3  v(v  G  w  -  name  -  of  -filejp))  v; 

’display’) 

Semantics  of  the  attributes  for  output  variables  in  this  specification  isnl  given  as  it  deals  with 
the  rules  of  representing  output  variables  in  the  external  media  and  is  similar  to  semantics  of 
corresponding  specifications  of  FORTRAN  output  format. 

□  Processing  input  data  files  lNPDT~DATA~f*ROCESSlNS(/^/7g5'). 

The  values  of  input  scalars  and  inputs  on  domain  represented  in  input  files  Files  are 
transformed  into  the  tuples  of  relations  InputData. 

i]JiPU7-DATA-rKGCESSINQ(F/7t;A')  ~  _ 

V/7(/7  G  Files  Aq  gTs-  name  -  of  -  fUejp)  a  Vr(r  g  is  -  element  -  of  -  inpiit^p))) 

->■  (3  v3  m(v  g  Ts  -  tume  -of  -  scalar  jr)  a  u  g  is-arithm-cons\2mt{r)  at  =  v  •  =  •  u) 
-^vouoq  ADD -TO  InputData', 

3  v3  w3  w(v  G  is-name-of  -  variable  -  on  -  domain{r ) 

AU  G  is  -  list  -  range  -of  -  index{r) 

AH'  G  is-data(r) Ar  =  v*(»M*)=»w) 

-^rfl/;  =  MAE-INDEX(v,M);6E3ffiX^  ADD -TO  Value', 

Far  o  Value  ®q  AD  D  -  TO  InputData)) 

Auxiliary  function  MAKE-INDEX(v,r/)  constructs  the  relation  with  each  tuple  equal  to  the 
name  of  input  variable  v  with  index-constants  which  values  are  defined  by  the  list  of  u  indexes 
ranges.  Let  list  u  =  where  //,  =  name-index,  •  =  •  a.  # . 

MAKE-INDEX(v,//)  ~ 

Temp'.^ ®}EBSlE(a,  •  •  6, )  0. . .®  €ENEEfiE(a„  • . .  •  ); 

Temp=ReplaceAll{  o  ,,,Temp)', 

Temp'.=\>  •(®  Temp  0) 

Function  SETNEXT(m')  gives  the  next  element  from  the  list  of  data  w. 

0  Checking  incut  data  CHECK'INPUT“DATA(P). 
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Data  must  be  defined  in  the  input  files  for  every  variable  introduced  during  the  work.  I.e.  tuple 
B  is  said  to  be  in  relation  InputData  for  every  tuple  a  in  relation  Input,  when  the  value  of  p  differs 
from  the  value  of  a  only  by  the  value  of  attribute  Value.  If  this  condition  is  satisfied  the  value  of 
Value  is  taken  from  tuple  p  and  put  into  tuple  a. 

CHECK-1NFUT-DATA(P)  ~  _ 

{lnput{  Value  )  -  InputData{  Value  )=  0 

Va(a  G  InputData  /Ka  =  {var  value  file)  a  P  €  Input  a  p  =  (vor  A  file) 

— >■  p:  =  (var  value  file)); 

-^F); 

□  Designing  the  araoh  of  parts  end  functions’  calls  CAliL“€[RAFH"CREATE(P). 

Directed  graph  of  parts  and  functions’  calls  G=‘(yfil)  consists  of  set  of  nodes  V  and  set  of 
arcs  E.  Set  F  =  {vj  consists  of  the  names  of  parts  and  functions’  called  in  program  P .  If  there  is  call 
from  part  (function)  v.  then  arc  (v,,v^.)  is  included  into  E.  Every  node  v.  contains  supplementary 
information  consists  of  the  list  of  formal  parameters  of  part  v,  (It  may  be  empty).  If  there  is  no 
declaration  of  part  but  there  is  its  call  then  the  error  is  fixed  in  designing  the  graph  and  F  becomes 

the  result  of  function  CALL-€iRAFH-CREATE(P). 

0  Checking  the  araoh  gf  functions  and  parts'  calls  for  being  correct  CHECK-C  ALL-GRAFH(f ). 

The  graph  of  parts  and  functions’  calls  is  correct  if  it  is  acyclic  with  one  node  which  has  no  re¬ 
entrant  arcs  (this  node  corresponds  to  the  call  of  the  main  part  of  program  P  and  further  will  be  called 
entry  node). 

Graph  of  parts  and  functions’  calls  G=(V,E)  is  checked  for  being  correct. 

0  Translating  the  operators  into  relation  Relations  OPERATORS-TO-RELATIONS  (o). 

Operators  ASSUME  are  unrolled.  Operator  specifying  some  rule  of  computation  F  for  the 
variable  of  the  program  in  all  the  points  ^  from  computation  domain  D,  r  =  l,...,|l>j|is 
represented  in  the  form  of  \\D\\  operators.  Each  operator  specifies  the  rule  of  computation  F  in 
particular  point  D,  in  computation  domain  D. 

If  operator  ASSUME  is  set  in  construction  iteration  then  it  provides  with  the  special  mark  in 
the  field  of  attribute  ItVal  =  Hid  •  C  when  it  is  included  into  relation  Relation. 

The  conditions  set  in  the  declaration  of  conditional  computation  domain  D  and  the  condition 
of  the  computation’s  arguments  being  defined  are  taken  into  consideration  in  putting  in  the  value  o 
attribute  Cond  The  value  of  variable  X  with  index  /  is  depended  on  some  arguments,  let’s  name 
them  Arg,  A', X,(/,),  X,  €  T,  /,  g  N.  Then  logical  function  DefiArg)  is 

added  to  condition  Cond.  Function  DefiArg)  will  be  true  (T)  if  all  X,  g  Arg  have  already  defined 

(  i.e.  their  value  is  the  value  of  attribute  Value*  A)  in  other  case  it  will  be  false  (F).  Function 

DeMrs)  is  a  built-in  function  of  AM-  machine  and  is  performed  in  the  process  of  its  work. 

Call  to  the  reduction  functions  and  external  functions  included  into  arithmetical  expressions 
are  eliminated  from  these  arithmetical  expressions  and  are  represented  by  new  scalar  operators 

included^irto  g'j'®^Pg°^2roctlons  fteration  are  processed  and  after  them  the  operators  which  are  not 
included  into  the  iterations. 

0FERAT0RS-T0-RELAT10NS(o)  ~  n  :=  QET-FARTNAME(o); 

(  \/qiq  G  is  -  iteration{o)) 
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->  rrERATI0N*T0-RELAT10NS(w,^);De/e/e(^,o); 

'ip{p&  is -operator  ip)) 

0FERAT0R-T0-REIAT10NS(p,/7,  0,0  yjDelete{q,o)) 

Here  are  auxiliary  functions  performing  analysis  of  construction  iteration  q. 

Function  rrERAT10N-STRUCTURE(^,a,6,c,c/,e,0  for  iteration  q  gives  value  T.  if  a  - 
header  of  iteration,  b  -  boundary  )^lues  of  iteration,  c  -  initial  values  of  iteration,  d  -  body  of  aeration, 
e-  exit  condition  of  iteration,  i  -  index  of  iteration: 

ITERAT10N-STRUCrnJRE(^,  a,  6,  c,  e, /)  ~ 

3a(/5  -  head  -  of  -  iteration(a)) 

A  3  b(is  -  boundary  -  values(i)) 

A  3c(is  -  initial  -  values(c)) 

A  3d  (is  -  body  -  of  -  iteration(i/)) 

A  3e(/5  -  exit  -  conditions(e)) 

A  3i(is  -  name-  index  -  iteration(i)) 

A  ITERATION •  ^ 

Functions  QET-HEAD-ITERATlON(q'),  eET-BOUNDARV(^),  QET-INmAL(^),  QET- 
B0DV(^).  eET-EXIT(^),  QET-ITERAT10N-lNDEX(g')  for  iteration  q  give  correspondingly 
header,  boundary  values,  initial  values,  body,  exit  condition,  index. 

6ET-HEAI>IT£RATI0N(r/)  ~  (ITERATION'STRUCTURECg,  a,  b,  c,  d,  e,  i)  ->  a) 
SET-B0UNDARV(^)  ~  (ITERAT10N-STRUCTURE(^,  a,  b,  c,  d,  e,  i)  b) 
6ET-1NIT1AL(^)  ~  (ITERAT10N"STRUCTURE(^,  a,  b,  c,  d,  e,  i)  c) 

6ET-fiODV(^)  ~  (ITERATION-STRUCTURE(q',  a,  b,  c,  d,  e,  /)  d) 

SET'EXrr(^)  ~  (rrERAT10N-STRUCTURE(^,  a,  b,  c,  d,  e,  i)  e) 
6ET-ITERAT10N-1NDEX(^)  ~  (rrERAT10N-STRUCTURE(^,  a,  b,  c,  d,  e,  i)  i) 
Here  is  the  function  processing  construction  iteration  q. 

ITERATION-TO-RELATIONS(w,  q)  ~ 

a  ;=  QET-HEAD-ITERATION(^); 
b  ;=  SET-B0UNDARV(^); 
c  ;=  QET-INrnAL(^); 
d  :=  QET-BODV(^); 
t;  ;=  eET-EXIT(4/); 
i  ;=  QET-ITERATION-INDEX(^); 

(\/p(p  &  is- operator (b)) 

-)•  IncIude(p,c);Include(p,dy,Delete{p,q)), 

{yp(p  e  is  -  element  -of  -  initiak(c) 

A 3a(is - int - const(a)  a  INITIAL •i-*a»:^p 
A  Vr(r  X  /5  -  operatorip))) 

0PERAT0R-T0-RELAT10NS(/-,  n,  i,  a)); 


Delete(c,  q); 
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(Vr(r  X  /5  -  operator(d))) 

0FERAT0R-T(HlELAT10NS(r,  n,  i,  1)); 
EXrr-T0-RELAT10NS(e,  n,  i,  1)) 

Function  OPERATOR-TOHRELATIONS(p,  n,  i,  a)  puts  the  tuples  into  relation  Relation.  The  form 
of  the  tuples  is  determined  by  the  type  of  operator  p  set  in  the  part  with  name  w  (/  -  index  of  iteration 
or  0 ,  a  -  the  value  of  iteration  index  or  0). 

0FERAT0R-T0-RELAT10NS(p,  n,  i,  a)  - 

(  is-operator-ASSUME(p)  ^  ADD-ASSUME(p,  n,  i,  a); 
is-scaIar-operator(p)  — >  ADD*SCALAR(p,  n,  i,  a); 
is-call-of-part(p)  ->  ADD-COMPUTE(p,  n,  i,  a)) 

Function  ADD-ASSUME(p,  n,  i,  a)  converts  operator  ASSUME  p  into  the  tuples  of  relation 
Relations.  Operator  ASSUME  p  is  set  in  the  part  with  name  n  (/  -  index  of  iteration  or  0 ,  a  -  the 
value  of  iteration  index  or  0). 

ADD-ASSUME(p,  n,  i,  a)  ~ 

d  :=  QET-DOMAlN-NAME(p); 

Rcond  :=  ^iDomains,  D  -  d){Cond)-, 

RIname  :=  ^{Domains,  D  =  d)(Iname); 

R1  :=  ^{Domains,  D  =  d)(I); 

RUnroIl  :=  RE?LACE'IND-VAL(/7,  RIname,  RI); 

RUnroll  ;=  EXCLUDE-FUNCTION(Rf///w//,  n,  i,  a), 

RLeft  :=  GET-LEFT-PART(RC/«ro//); 

RRight  :=  GET-RIGHT-PART(R(/«ro//); 

RVar  :=  GET-VAR(i?Le^); 

{3z(is  -  relation(r)  Azxp)-^  RFunc :  =  RRight); 

3z(/5  -  call  -  of  -  part(r)  Azxp)^  RFunc :  =  RUnroll); 

RDep  :=  Def(  o  GET-DEPENDENCE(i?/?/g/»0  ° ); 

RLambda  :=  RVar;  (Vc(c  e  RLambda)  -^\i?DAlE{c,A,RVar)); 
n  o  ia  °  RDep  o  RCond  o  RVar  o  RLambda  <>  RFunc  AD  D_  TO  Relations 

Function  ADD-COMPUTE(p,  n,  i,  a)  converts  operator  COMPUTE  p  into  the  tuples  of 
relation  Relations.  Operator  COMPUTE  p  is  set  in  the  part  with  name  n  (/  -  index  of  iteration  or  0 , 
a  -  the  value  of  iteration  index  or  0). 

ADD-C0MPUTE(/7,  n,  i,  a)  ~ 

q  ■-  EXCLUDE-FUNCnON(/;,  n,  i,  a); 

I  ;=  SET-LEFT-PART(^); 
r  :=  QET-RlSffr-PART(^); 

RVar  ;=  6ET-VAR(/,  0, 0  ); 

RFunc  :=  q, 

RDep  ;=  Def(  o  SET-DEPENDENCE(r)  o  ); 

RLambda  :=  Rvar;  (Vc(c  e  RLambda) -^\i?DME{c,A,RVar)); 
n o ia o  RDep oTo RVar  o RLambda^  RFunc  ADD_TO  Relations 
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Function  ADD-SCALAR(p,  n,  i,  a)  converts  scalar  operator/?  into  the  tuples  of  relation  Relations. 
Scalar  operator  p  is  set  in  the  part  with  name  n  (/  -  index  of  iteration  or  0 ,  a  -  the  value  of  iteration 
index  or  0). 

ADD-SCALAR(/?,  n,  i,  a)  ~ 

q  :=  EXCLUDE-FUNCnONO?,  w,  /,  a); 

/  :=  eEr-LEFT-RART(e); 
r  ;=  eEr-R16HT-PART(^); 

RDep  ;=  Def(  o  6£T-DEPENDENCE(r)  o  ); 
noiaoRDepoT oloKor  ADD_TO  Relations 

Function  EXIT-T0-RELAT10NS(e,  n,  /,  a)  converts  operator  of  iteration’s  exit  condition  e  into  the 
tuples  of  relation  Relations.  Operator  of  iteration’s  exit  condition  e  is  set  in  the  part  with  name  n  {i  - 
index  of  iteration  or  0 ,  a  -  the  value  of  iteration  index  or  0). 

EXIT-T0-RELAT10NS(c,  n,  i,  a)  ~ 

q  :=  EXCLUDE-FUNCnON(e,  n,  i,  a); 

RDep  ;=  Def(  o  SET-DEPENDENCE(^)  o  ); 

n°ia  °  RDep  oq°0°0°0  AD  D_  TO  Relations 

Function  €lET-DOMAIN-NAME(/?)  gives  the  name  to  the  domain  of  operator  ASSUME/?. 

QET-DOMAIH-NAME(/?)  ~  {^n{is  -  name  -  of  -domain(n)  a  FO  R  •  ??  •  ASSUME  x  />  ->  /;) 

Function  QET-LEFTHPART(/?)  gives  the  left  part  of  p  operator’s  relation  or  the  list  of  actual 
parameters-results  placed  after  key-word  RESULTS  of  parts  call  COMPUTE  in  operator/?. 

QET-LEFT-PART(/?)  ~ 

(  3n3m3s3t(is  -  name  -  of  -domain(n)  /\is-  name  -  of -simple  -  part  {m) 

A  is-  WsX- initial  -  parameter  (s) 

A  is-  list  -  parameter  -  result{t ) 

A  FOR  •  ?/  •  ASSUME  COMPUTE  •  /w  •  (  •  s  •  RESULTS  •/•)  =  /?) 

3m3s3t{is  -  name  -  of  -  simple  -  part  jm) 

A  is  -  list  -/?//7/a/  -  parameter's) 

A  is-  list  -  parameter  -  result  {t ) 

A  COMPUTE  ./?;.(•  i  •  RESULTS  •t*)  =  p) 

3n3m3a{is  -  name  -  of  -  domain(n)  a  is  -  arithm  -  expression(fl) 

A  is  -  variable  -  on  -  domain(/??) 

A  FOR  • /?  •  ASSUME  • /w  • = •  a  = /?) 

->»r, 

3m3a{is  -  arithm  -  expression{a)  a  is  -  name  -  of- scalar  (m)  Am»=*a  =  p) 

->/n) 
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Function  QET*RlQHT-PART(p)  gives  the  right  part  of  p  operator’s  relation  or  the  list  of 
actual  initial  parameters  placed  before  key-word  RESULTS  of  part's  call  COMPUTE  in  operator/). 

6ET-RIQHT-{»ART(p)  ~ 

(  3rBm3s3t{is  -  name  -  of  -domain(n)  a  is-  name  -  of  -  simple  -  part{m) 

A  is-  list  -/w7/a/  - parameter{s) 

A  is  -  list  -  parameter  -resultit) 

A  FOR  .  n  •  ASSUME  COMPUTE  •  m  •  ( •  ^  •  RESULTS  •t*)  =  p) 

3»i3j3/(/5  -  name  -  of-  simple  -  part(m) 

A  is-  list  -  initial  -  parameter's) 

A  is-  list  -  parameter  -result(t) 

aCOMPUTE*w(. 5. RESULTS.  f)  =  p) 

3n3m3a{is  -  name  -  of  -domain(n)  a  is-  arithm  -  expression(a) 

A  is  -  variable  -  on  -  doniain(w) 

A  FOR  •  n  •  ASSUME  •m»=*a  =  p) 

—>a; 

3m3a{is  -  arithm  -  expression(a)  a  is-  name  -  of  -  scalar  im)  Am*=»a  =  p) 

->a) 

Function  EXCLUI>E-FUUCTION(m,  n,  /,  a)  converts  calls  to  the  reduction  functions  and 
external  functions  set  in  operator  u  from  the  part  with  name  n  (/  -  index  of  iteration,  a  -  the  value  of 
iteration  index). 

EXCLUDE-FUNCnON(M,  n,  i,  a)  ~ 

(  y/(./  €  is  -  call  -  to  -  reduction  -  fitnciion(  ti)) 

->/  :=  unameiP),  Replacef,  t,  t/);  ADD-REDUCTHFUNC(/,/,  n,  i,  a); 
V/(/  e  is -call -to- external  -  ftinction{u)) 

->r  ;=  uname{P);  Replaceif,  t, ;/);  ADD-EXTERN-FUNC(/,./;  n,  i,  a)) 
ADD-REDUCT-FUNC(/,  v,  n,  i,  a)  ~ 

(  3<7353a  (is  -  name-  reduction  -  fimction(q)  a  is-  name  -  of  -domain(s) 

A  is  -  arithm  -  expression{a)  a  c/  •  ((  •  5  •  )  •  /'  •  )  =  v) 

-^Rlname  :=  ^{Domains,  D  =  s)(lname): 

RI  ;=  ^{Domains,  D  =  5)(/); 

SumUnroll  ;=  REPLi\CE-IND-VAL(a,  RIname,  RT); 

SumUnroll  ;=  EXCLUDE-FUNCTION(5//w67?/-o//,  //,  /,  a); 
RSumDep  ;=  Def(o  GET-DEPENDENCE(iS«/nf//?ro//)o ); 

SumDep  :=  REL-IN-LIST(/«’M/nDe/)); 
noiaoSumDepoToto Aov  ADD_TO  Relations  ) 

Auxiliary  functior»  REL-iR-LIST(i25r/wjZ)ep)  represents  the  tuples  of  relation  RSumDep  in 
the  form  of  the  list  of  tnple&enamurated  by  comma. 
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ADD-EXTEMJ-nJNC(r,  v,  n,  i,a)~ 

(  3q3s(is  -  name-  extern  -  Junctionjq)  a  is  -  list  -  initial  -  parameter's) 

Ag»(*5«)  =  v) 

ExtDep  ;=  Def(oGET-DEPENDENCE(5)o); 
n °ia ° ExtDep oT ° t o Aov  ADD_TO  Relations  ) 

Function  6ET-VAR(i),/  -  variable  on  domain  with  indices-constants  or  Wst-parameter-result 
with  indices  constants  creates  the  list  of  variables  on  domain  with  indices  constants  where  all  the 
variables  set  in  /  are  included. 

QET-VAR(/)  ~ 

(  is-  name  -  of -scalar  (1)  /; 

is  -  variable-on-domainij)  /; 

3p3qiis  -  nnmp-  variahle  -on-  domain!  p)  a  is-  domain  -  of  -  parameter  (q) 
aI  =  p*  on  •  q) 

-»UST-VAR  (p,  q); 

3p3q{is  -  iterated  -  variable  -  on  -  domain{p)  a  is-  domain  -of  -  parameter  {q) 

A  /  =  p  •  ON  •  q) 

-^LIST-VAR(p,^)) 

Function  UST-VAR  ip,  q)  creates  auxiliary  relation  Temp(Cond,D,InameJ),  which  tuples 
set  the  domain  of  parameter  q  and  generate  the  list  of  variables  with  namep  and  indices  from  relation 
Temp. 

UST-VAR  (p,^)~ 

(  is  -  name  -of  -  unconditional  -  domain(q) 

-¥  Temp.=  F{Domains,D  =  q); Temp{I); 

(Vc(c  e  Temp)  q*{  Replace All{°„,c)  •  )); 

3  .r3  >'3  a{is  -  name  -  of  -  unconditional  -domain{x)  a  is  -  name  -of  -  indexjy) 

Ais  -  int-  constan  t(a)  Aq  =  x*l*y*  =  *  a) 

Temp.=  ?{Domains,D  =  q  a  Iname  =  a  /  =  a);  Temp{iy, 

(Vc(c  e  Temp)  ->q*{  Replace All{o„,c)  •  )); 

3  .y3  z(js  -  name  -  of  -  unconditional  -  domain(x)  Ais-  list  -  index  -  expression{z) 

A^  =  Y  •  /  {  •  r  •  )) 

( V v’Cv  €  Is  -  name  -  of  -  index(z)  a  3  a{is  -  int-  constan  t(t7)  a  v  •  =  •  a  oc ::)) 

_>  Temp:  =  ¥{Domains,  D  =  q  a  Iname  =  y  a  1  =  a);  Tempi  I); 

(Vc(c  €  Temp)  -^q*{  ReplaceAllio„,c)  •  )).) 

Function  QET-DEFENDENCE(/),  /  -  WsX-initial-parameter  or  arithmetical  expression 
creates  the  list  of  arguments  Arg  for  function  of  i4A/-machine  DefiArg)  where  all  the  variables  set  in 
/  are  included  in  rraatinn  of  De/ attribute’s  value  from  relation  Relations  the  condition  of  the 
definite  determination  of  the  computation’s  arguments  is  considered  and  logical  function  DefiArg))  is 
added.  Function  will  be  true  (T)  if  all  X,  €  Arg  have  already  defined  ( i.e.  their  value  is 
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the  value  of  attribute  Value  ;t  A)  in  other  case  it  will  be  false  (F).  Function  Def^Arg)  is  a  built-in 
function  ol  AM-  machine  and  is  performed  in  the  process  of  its  work. 

6ET-DEFENDENCE(/)  ~ 

(/5  -  arithm  -  expression(/) 

->  (Vr(r  &Ts-  variable  -on-  domam{[)  vre  is-  name  -  of  -  scalar  ij))  ->  r); 

3p3q(is  -  name-  variable  -on-  domaini  p)  a  is  -  domain  -of  -  parameter  iq) 

A  p»  ON  •qx I 

-^UST-VAR(p,q); 

3p3qiis  -  iterated  -  variable  -  on  -  domain{p)  a  is  -  domain  -  of  -  parameter  {q) 

A  p*  ON  •q^l 

-^LlST-VAR(p,gr)) 

2.8  The  rules  of  ^4A/-machine  work 

Re\3lC\ouARelations{Name,ItVal,Def  Cond,  Var,  Value,  Fund),  ARelations  ^  Relations  wiil  be 
named  active  relation  of.i4M-machine. 

Transfer  of  the  tuples  from  relation  Relations  to  ARelations  is  carried  out  by  built-in 
functions  of  .;4M-machine 

CALLrPAST{(p{actual-parameters))p-  name  of  part  and 
NEXT-ITERATION (n •i),n-  name  of  iteration  index,  i  -  value  of  iteration  index. 

Function  CAlLr^AfUiipiactual-parameters))  invokes  the  tuples  of  relation  Relations 
corresponding  to  computations  of  part  p. 

Function  NEXT-ITERATION  (/?•/)  invokes  the  tuples  of  relation  Relations  corresponding 

to  the  next  iteration  step  /+!  in  iterative  computation  by  iteration  index  n. 

Definition  of  these  functions  are  given  below. 

Interpretation  of  abstract  program  A{P)  by  abstract  machine  AM  consists  of  the  initialization 
and  stages  of  interpretation. 

1.  Initialization. 

1)  Function  CHECK-REASSI6NMENT(/?e/aftow5)  is  carried  out.  It  checks  the  condition  of 
single-assigning  the  value  to  the  variable  of  program  P. 

2)  Function  CALL-PART(777a/>7)  is  done.  Main  -  name  of  entry  node  of  the  calls’  graph 
G(F^. 

2.  Stages  of  interpretation. 

1)  Set  ArgDefined  is  chosen  from  relation  Relation.  ArgDe fined  is  a  set  of  tuples  which 
have  all  the  arguments  computed  but  the  values  of  the  variables  haven’t  computed  yet.  The  tuples 
from  ArgDefined  which  computation  depends  on  the  condition  Cond  and  Cond~¥  are  eliminated 

from  ArgDefined  and  ARelations. 

ArgDefined  :=  ^{ARelations,  Def=  T  a  Value  =  A ); 

ArgDefined  ;=  ArgDefined  -  ^{ArgDefined,  Cond=  F); 

ARelations  ;=  ARelations  -  ^{ARelations,  Cond  =  F) 

2)  Arbitrary  subset  NonDeterm  c  ArgDefined  is  chosen.  Arbitrariness  of  such  a  choice  is 

determined  by  non-determination  of  /4A/-machine  and  allows  to  specify  not  the  only  process  of 
program  P  fulfillment  but  the  class  of  all  possible  computations. 
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NonDetermQ  ArgDefined 

3)  Function  June  is  computed  for  every  tuple 
{name  o  itval  °  def  ®  cond  ®  vqt  o  value  o  Junc^  from  NonDetemt.  This  function  is  set  in  the  field  of 

attribute  Func  of  the  same  tuple.  Besides  some  variants  are  possible: 

a)  func  -  call  of  simple  part  nameiactual-parameters).  The  following  function  of  AM- 
machine  is  carried  out  in  this  case: 

CfdlrfhKl{name{actual-parameters)) 

b) Jiinc  -  call  of  external  function  nameiactual-parameters).  The  following  function  of  AM- 
machine  is  carried  out  in  this  case: 

QMlr^MO[{nameiactual-parameters  RESULT  name)) 

c) func  -  arithmetical  expression,  call  of  reduction  function  or  operator  INPUT. 

Arithmetical  expression  and  reduction  function  are  computed  (  semantics  of  reduction 

functions  coincides  with  common  mathematical  definition  of  these  functions),  operator  INPUT  is 
carried  out  and  obtained  value  val  substitutes  value  value  =  A  of  considered  tuple.  Besides  if  there 

is  tuple  {cond  o  var  o  value^  o  Jtle^  in  relation  Output  then  the  value  value^  =  A  of  this  tuple  is 

substituted  to  val. 

Formal  definition  of  computation  from  NonDeterm: 

COMPUTE(AbwDe/erm); 

COMPUTE(//o;7c/c/erw)  ~ 

(  nondeterm  =  {name  o  itval  o  def  o  cond  «  var  o  value  o  fine) 

-^i  is- call  -of  -  partifunc) 

CALL-PART(/Mnc);  Deleteinondeterm,  NonDeterm)-, 

3y3/(75  -  name- external  -  functionjf) 

A  is-  list  -  initial  -  parameteril)  a/*(»/*)  =  fime) 

CALL-PART(  /•(•/•  RESULTS  •/•));  Deleteinondeterm, 

NonDeterm); 

is  -  arithm  -  expression(^/wc) 

-rvalue  :=  EVALUATE-EXFRO^/wc);  WDATEiOutput,  value); 
is  -  call  -  to  -  reduction  -  function(^/wc) 

-rvalue  :=  REAUZE(/rwc);  UPDATE(0;//;7?//,  value); 

//wc  =  INPUT 

-rvalue  ;=  lNPUT(//7/)uf,  Var);  VPDAlEiOutput,  value))) 

Specification  of  function  M^UhlEiOutput,  value)  and  lNPUT(/;7/7r/r,va7-)  is  evident  enough 
and  accurate  definition  of  function  REALIZE(^//ic)  is  determined  by  realization  of  ^-machine. 

4)  Set  OutDefined  is  chosen  from  relation  Output.  OutDefined  is  a  set  of  tuples  which  have 
all  the  arguments  computed.  The  tuples  from  Out  Defined  which  computation  depends  on  the 
condition  Condan6  Condp¥  are  eliminated  from  OutDefined. 


OutDefined ;=  fiOutput,  Value *K); 
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OutDefined  :=  OutDefined-  '^{Output,  Cond-  F); 

5)  Arbitrary  subset  NonDetermOiit  c  OutDefined  is  chosen.  The  value  value  is  output 
into  file^r/e  for  every  tuple  {cond  «  x  o  /  o  value  o  file)  from  NonDetermOut. 

NonDetermOut  c  OutDefined 
OutDefined  ;=  OutDefined  -NonDetermOut 
OJJTVimiNonDetermOut) 

6)  Iterative  computations  set  by  the  tuples  of  relation  ARelations  for  the  next  values  of 
iteration  indices  (if  the  iterative  computations  aren’t  finished  in  accordance  with  the  condition  of  an 
exit  from  the  iteration)  are  invoked. 

Activlter  ;=  P{ARelations,  ItVal  ^0  0); 

Activlterindex  :=  TT-lNY)i,X(ActivIteriItVar)); 
NlXT-TTETtATLONiActivIterlndex) 

7)  Condition  CHECK-STOP  of  AW-machine  stop  is  checked. 

If  all  the  necessary  computations  are  finished  and  the  values  of  all  the  variables  which  are  to 
be  computed  have  been  computed  then  AW-machine  finishes  working  it  NORMA’Iiy  and  passes  over 

to  state  stop.  , 

If  there  are  variables  which  values  are  to  be  computed  but  new  computations  aren  t  possible 

then  A/V/-machine  finishes  working  it  AbNORMA’Iiy  and  passes  over  to  state  error. 

In  other  case  the  next  stage  of  iteration  is  carried  out. 


CHECK-STOP  ~ 

Activlter  =  F{  ARelations, ItVal  ^  0  0); 

{  ?{ActivIter,Var  =  0){Cond)  =  T  a  ARelationsiyalue)  *  A 

stop; 

f (Activlter, Var  =  0){Cond)  =  T a 3  a(a  €  ARelations{Value) /\a^  A 

error; 

->  nextAMstep) 

^  Checking  the  condition  of  sinale-assianment  CHECK-REASSIQNMENT(/?t?/ar/o/?j). 

If  there  exist  even  two  coincided  variables  which  values  are  to  be  computed  and  the 
conditions  of  theses  variables’  computation  don’t  come  into  the  conflict  then  the  condition  of  single¬ 
assignment  is  broken. 

CHECK-R£ASSIQNMENT(i?i?/a//o/w)  ~ 

(3  a, 3  a, (a,. a,  e  Relations 
A  a,  -  name ^  oitval^  °def^  °cond^  o  var,  o  va/r/e,  o  func^  i 

Atti  =\name2  °itval2  ^defi  cond 2  «>  var^ »  va/rrr;2  ' 

A  var,  n  var,  ^  0 
A  (cowrf,  A  -icond2 )  =  F) 

Aitvalj  =itval2 

^F) 
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□  Invoking  the  tuples  while  calling  the  osrt  CALL~PART(c). 

Correspondence  of  actual  and  formal  parameters  of  c  part’s  call  is  set  and  the  actual 
parameters  are  passed  to  the  tuples  of  the  part.  The  tuples  setting  iterative  computations  are 
initialized  and  all  the  tuples  are  transfered  into  relation  ARelatiom  (with  the  renaming  of  the  local 
variables). 

CALL-PART(c)  ~ 

(3/73  qiis  -  name  -  of  -  pari(j>)  a  is  -  actual  -  parameters{p)  Ap*{»q»)  =  c) 

->  RPart:=  ?{Relations,Name  =  /?); 

/=QET-FORMAL(^); 

RActul=  ACTUAL-FORMAL(i?Parf/^,^); 
i?Reflify;=lNI-rTi:RATlON(/M//); 

RENAME-IN-CALl(iy?ear^)  ADD-TO  ARelations) 

Function  6ETHF0RMAL(p)  analyses  the  graph  of  calls  G(F,£)  and  give  the  formal 
parameters  of  part  p. 

Function  ACTUAL-FORMAL(i?Parf/^,^)  carries  out  passing  actual  parameters /to  the 
tuples  of  relation  Rpart  from  the  part  with  name  p  and  formal  parameters  q. 

ACTUAL-FORMAL(i?Parr/^,^);  ~ 

CHECK-FORMAL-ACTUAL(/;^); 

(3  g3h3  s(is  -  list  -  name(g)  a  is-  list  -  name{h) 

A^.  RESULTS. /?  =  / 

A/s  -  list  -  initial  -  parameter{r)  a  is-  list  -  parameter  -  result{s) 

Ar  •  RESULTS* s=^ 

->  Rpart  :=  MAKE-IN(p^arf^,r);  Rpart  :=  MAKErOVT(pJiPartM) 

Function  MAKE-lN(p^arf^/)  chooses  the  next  elements  from  the  lists  of  formal  initial 
parameters  g  and  actual  initial  parameters  r. 

UAKErm(pJiPart^,r)  ~ 

(/s-name(^)  ^  MAKE-IN-PARAMETER  {pfiPart^,r,y, 

/5-list-name(^)  ->  MAKE-IN-PARAMErER(p^a/y,//eai/(^»a4r)); 

UhKErV^i(pMartJail{g)Jail{r))^ 

Function  MAKE-Oirr(/?^ar/,g,r)  chooses  the  next  elements  from  the  lists  of  formal 
parameters  results  g  and  actual  parameters  results  r . 

MAKE-OUr(/7^a/7,^,r)  ~ 

(/s-name(g)  ->  MAKE-OUT-PARAMETER  (pJiPart^^,); 

/s-list-name(g)  ->  MAKE-OUT-PARAMETER  {pflPartfiead{g)fiead{r)y, 

MAKE-OUT  (pJlPart,tail(g),tail{r)y) 

Function  MAKE-IN-PARAMETER(p^arr^/,)  modifies  the  tuples  of  relation  Rpart 
from  the  part  with  name  p  while  substituting  initial  formal  parameter  g  to  initial  actual  parameter  r. 
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MAKE-IN-PARAMETER(p^art^,r,)  ~ 

{is-  arithm  -  expression(r) 

val  =  BfiaiSE-  ESP8^) /?  o  T  o  T  o  g  o  va/  o  va/  AD  D  -  T O  RPart; 
is  -  name  -of  -  external  -  simple  -  part(r )  — >  Replace  Ailing, r,  RPart), 
is  -  name  -of  -  external  -  Junction(r )  ReplaceAll{^g,r  ,RPart), 

3  w3  ^(is  -  name  -  of  -  vzx  table  -  on  -  domain{n)  a  is-  domian  -  of  -  parameter{^) 

/\r  =  n*  ON •  q) 

->  (3  x3  y(is  -  name  -  of  -  unconditional  -  domain{x)  a  is-  list  -  ind  -  expression{y) 
Aq-x*l*  y)) 

namey=  uname{P),Replace{q,name^,ry,new -domain(riame^,x,yy, 

name{=  uname{P); 

/nc/Mc/e(VARIABLE  •  name^  •  DEFINED  ON  •  name■^  •  .,p); 
VAR-DECLARATION(p); 

Include(FOR  •  name^  •  ASSUME  •  name^  •  =  /?•[•;/•]  .,p); 
Include(¥OR  •  mwne,  •  ASSUME  •  g*  =  name^  •  .,p); 
REBUILD-INDEX(p);  OPERATORS-TO-RELATIONSO?); 

is  -  name  -  of  -  unconditional  -  domam{q) 

name{.=  tiname{P); 

Include(V ARIARLE •  name^  •  DEFINED  ON  •q*  .,p); 
VAR-DECLARATlON(/?); 

Include{¥OR  *  q*  ASSUME  •  namCj  •  =  n  •  .,py, 

Include{¥OR  •  q*  ASSUME  •  g •  =  name,  •  .,py, 

REBUlLD-INDEX(p);  0PERAT0RS-T0-RELAT10NS(p))  ) 

Function  NEW’-DOMA1N(/7;cj’)  builds  new  domain  with  name  n  from  the  domain  with  name 
xand  list  of  modifications y.  This  function  is  an  analogous  of  the  superposition  of  functions  RECT-N- 
DOMAlN-TRAJNSFORM(o)  and  RECT-N“DOMA1N-EVALUATE(o)  defined  above. 

Function  MAKE-OUT-PARAMETER(pJ^arf,g,r)  modifies  the  tuples  of  relation  Rpart 
from  the  part  with  name  p  while  substituting  formal  parameter-result  g  to  actual  parameter-result  r. 

MAKE-OUT-PARAMETER  ipJiPart,g,r,)  ~ 

{is-  name  -of  -  scalar{r)  — >  po'Y  o'T  °  g°  val  oval  AD  D  -  TO  RPart; 

3  //3  q{is  -  name  -of  -  variable  -  on  -  domain{n)  a  is  -  domian  -of  -  parametericf) 

Af  =  n  •  ON  •  q) 

->  (3  x3  v(^'5  -  name  -  of  -  unconditional  -  domainfx)  a  is  -  list  -  ind  -  expression{y) 
r^q  =  x*l*  y)) 

->  name{.-  imame{P);Replace(q,name^,ry,new -domain{name^,x,y); 

name{=  uname{P); 

/«c/7/£/e(  VARIABLE  •  ncnne^  •  DEFINED  ON  •  ncane^  •  .,p)i 
VAR-DECLARAT10N(p); 

/;7c/7/c/?(FOR  •  name^  •  ASSUME.//  •  =  name,  •  .,py. 
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/«c/w£fe(FOR  •  name^  •  ASSUME  •  name^  •  =  •  g  •  ^p)', 

REBU1LD-INDEX(/7);  OFERATORS-TO-RELATIONS(p); 
is  -  name  -  of  -  unconditional  -  domain((f) 

name^  =  uname(P); 

IncludeiVARlABLE  •  name,  •  DEFINED  ON  •  ^  •  .,p); 
VAR-DECLARAT10N(p); 

lnclude(FOR  •q*  ASSUME •  « •  =  name,  •  .,/>); 

Include(EOR  •q*  ASSUME •  name,  •  =  g •  .,py, 

REBUlLD-INDEX(p);  0FERAT0RS-T0-RELAT10NS(p)) ) 

n  Invoking  tuples  at  the  next  iteration  step  NEXT-1T£RAT10N(w). 

If  the  next  iteration  step  on  iteration  index  n  is  terminated  and  the  exit  condition  of  iteration 
hasn’t  been  performed  then  the  tuples  are  invoked  for  the  next  iteration  step. 

NEXT-ITERATION(w)  ~ 

OneIter=  ?{ARelatiom,ltVal  =  «); 

(FiOneIter,Var  =  0)iCond)  =  F  a  F(OneIter,Var  ^  0)(yalue)  A 
->  QET-NEXT-ITERATlON(w); 
_>RESULT-ITERAT10N(n,wiar(0we//er(/fFa/)))) 

The  tuples  specifying  the  body  of  iteration  are  chosen  from  relation  Relations. 

QET-NEXT-ITERAT10N(/7)  - 
Nextiter:  =  ^{Relations,  ItVal  =  n*\); 

( Vc(c  €  Nextiter  a  c  =  {mme  o  itval »  def  <>  cond  o  varo  value  o  June) 

-)•  [i?DME{c,{nameonoi  +  lodef  o  cond  o  varo  valuer  June),  Nextiter); 
lNMTERATION(Aexf/fer)  ADD-TO  ARelations) 

Function  INl-ITERATION(i?)  initializes  iterative  computations  set  by  the  tuples  of  relation  R 
INHTERAT10N(i?)  ~ 

( Vc(c  e  i?  A  c  =  {name  o  itval  <>  def  <>  cond  »  var®  valtte  <=  fine)  a  itval  =00 

( Vc(c  €  RAll  Ac  =  {name  o  in'al  o  def  o  cond  o  varo  value  <=  finic)  a  it\'al  =  i  •  a 
->  ReplaceAll{i,a,c)) 

Function  RESULT-lTERAT10N(7/,a))  substitutes  the  name  of  iteration  index  n  to  its 
maximum  value  a  in  all  the  tuples  of  relation  ARelation  which  don’t  set  iterative  computations. 
Passing  the  results  of  iterative  computations  is  done  in  this  way. 

There  is  auxiliary  function  U-INDEX  (n  •  /)  ~  w . 
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2.9  The  example  of  y4A/-machine  work. 

Consider  program  P  where  the  principal  constructions  of  the  NORMA  language  are  given.  Note  that 
this  program  is  demonstrative  and  computational  formulae  used  in  the  program  dont  essentially 
describe  any  known  numerical  method. 


MAIN  PART  One. 

BEGIN 

Oil:(OD;On). 

Oii;{  Oi:(i=l..n-l):  O|:0=T.n-l) ). 

Oii:Oj/j=l ..n-2.  On:OI/OI-RIGHT(n-3). 

INDEX  i,j. 

DOMAIN  PARAMETERS  n  =  4. 

VARIABLE  aJJs Jn  DEFINED  ON  Olj.  VARIABLE  x.y  DEFINED  ON  Ol. 
VARIABLE  Cond  DEFINED  ON  Oji.  VARIABLE  b  DEFINED  ON  O]. 

OjiYES  ,  OjiNOT ;  Oji/  Ts  <  0.5. 

INPUT  a(FILE=’data.dat')  ON  Olj.  b(FILE='data.dat')  ON  Oj. 
INPUT  T  ON  Olj/l=l.i=2..n-1,  T  ON  Olj/i=l,l=2..n-l. 

OUTPUT  T(FILE=’results')  ON  Olj/l=n-T,|=n-l. 

OUTPUT  x(FILE='results’)  ON  Ol. 

FOR  Ol  ASSUME  x  =  y  +  SUM((Oj)a*b);  y  =  bG=ll. 

FOR  OI]/l=2..n-l ,i=2..n-l  ASSUME  T  =  Ttl-11+Tl|-ll;  Ts  =  SINP). 

FOR  03/1=1, 1=1. .n-1  ASSUME  Ts  =  SIN(l+i). 

FOR  0IJ/j=1  ,l=2..n-l  ASSUME  Ts  =  SIN(l-j). 

FOR  01  ASSUME  COMPUTE  Two(Ts  ON  Oli/j=l  RESULT  Tn  ON  Oi|/l=l). 

OUTPUT  Tn(FILE='results')  ON  Ol],  Cond  ON  OjIYES. 

FOR  OjiYES  ASSUME  Cond  =  Tn. 

FOR  OjiNOT  ASSUME  Cond  =  Ts. 

END  PART. 

PART  Two.  Ts  RESULT  Tn 
BEGIN 

OI:(i=1..M).  DOMAIN  PARAMETERS  M  =  3. 

VARIABLE  Ts,Tn,T  DEFINED  ON  01.  VARIABLE  Eps. 

INPUT  Eps.  INPUT  T(FILE='data.dat’)  ON  Ol. 

ITERATION  Tn  ON  N. 

INITIAL  N  =  0  : 

FOR  01  ASSUME  Tn  =  Ts. 

END  INITIAL 

FOR  01  ASSUME  Tn  =  Tn[N-1]/M. 

EXIT  WHEN  ABS(Tntl=3,Nl)  <  Eps. 

END  ITERATION  N. 

END  PART. 


Input  file  data.dat: 


bG=1..2) 

a{i=1..3,j=1..3) 


T(l=1..3) 

ba=3) 


5.0,  7.0: 

2.0,  3.0,  -4.0, 
3.0,  4.0,  -1.0, 
3(5.0); 
3(-l.l): 

?.0; 


Input  file  .norma.dat: 


Eps 

T(l=2..3,j=1) 

T(i=l,j=2..3) 


=  1.0E-6: 
=  3.3,  4.4: 
=  1.1,  2.2; 


The  result  of  performing  function  LOCAL-VARlABLES-ROAME(o),o=One,Two  -  the 
program  and  input  files  get  the  form  ( we  consider  function  umme{P)  engenders  unique  names  by 
adding  zeroes  as  the  last  identificator’s  symbols). 


MAIN  PART  One. 
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BEGIN 

OjiO:(  OjjO;  OHO ). 

OiiO:(  OiO:(10=l..nO-l);  OiO:00=l..nO-l) ). 

OB0:O]0/i0=l..n0-2.  OnO:OH)/OIO*RIGHT(nO-3). 

INDEX  iO.jO. 

DOMAIN  PARAMETERS  nO  =  4. 

VARIABLE  aO.TOJsOJnO  DEFINED  ON  Ol]0.  VARIABLE  xO.yO  DEFINED  ON  OlO. 
VARIABLE  CondO  DEFINED  ON  OjiO.  VARIABLE  bO  DEFINED  ON  OjO. 

OjiYESO  .  OjiNOTO  :  OjiO/  TsO  <  0.5. 

INPUT  aO(FILE=’dcrta.dat')  ON  OljO,  bO(FILE=’dcrta.dat’)  ON  OjO. 
INPUT  TO  ON  0i|0/l0=l,i0=2..n0-l,  TO  ON  0ij0/|0=l,f0=2..n0*1. 
OUTPUT  TO(FILE='resolts')  ON  0i|0/i0=n0-l,i0=n0-1. 

OUTPUT  xO{FILE='results’)  ON  OlO. 

FOR  OlO  ASSUME  xO  =  yO  +  SUM((O|0)a0n>0);  yO  *  b0D0=l01. 

FOR  OliO/iO=2..nO-l ,]0=2..n0-1  ASSUME  TO  =  TOpO-ll+TOO-Tl:  TsO  =  SIN(T0). 
FOR  OljO/iO=l,iO*T..nO-l  ASSUME  TsO  =  SIN(IO+jO). 

FOR  OljO/jO=l,IO=2..nO-1  ASSUME  TsO  =  SIN(iO-iO). 

FOROiO  ASSUME 

COMPUTE  Two(TsO  ON  0l|0/i0*i0  RESULT  TnO  ON  0ii0/10=i0). 

OUTPUT  TnO(FILE='results’)  ON  OljO,  CondO  ON  OjiYESO. 

FOR  OjiYESO  ASSUME  CondO  =  TnO. 

FOR  OjiNOTO  ASSUME  CondO  =  TsO. 

END  PART. 

PART  Two.  TsOO  RESULT  TnOO 
BEGIN 

OIOO:(IOO=1..MOO).  DOMAIN  PARAMETERS  MOO  =  3. 

VARIABLE  Ts00,Tn00,T00  DEFINED  ON  OlOO.  VARIABLE  EpsOO. 

INPUT  SpsOO.  INPUT  TOO(FILE=’dcita.dat’)  ON  OlOO. 

ITERATION  TnOO  ON  NOO. 

INITIAL  NOO  =  0  : 

FOR  OlOO  ASSUME  TnOO  =  TsOO. 

END  INITIAL 

FOR  OlOO  ASSUME  TnOO  =  TnOOjNOO-lj/MOO. 

EXIT  WHEN  ABSCTn00[l00=3,N001)  <  EpsOO. 

END  ITERATION  NOO. 

END  PART. 


Input  file  data.dat: 


Input  file  norma.dat: 


bOGO=1..2) 

aO{IO=1..3,jO=1..3) 


T00(I00=1..3) 

b000=3) 


5.0,  7.0: 

2.0,  3.0,  *4.0, 
3.0, 4.0.  -1.0. 
3(5.0): 
3(-l.l): 

?.0: 


Eps  =  l.OE-6: 

TO(IO=2..3,jO=1)  =  3.3, 4.4: 
TO(IO=l,jO=2..3)  =  1.1, 2.2: 


The  result  of  caryying  out  function  DEF-INDEX-SPAC£(o),  o  -One,  Two  constructing 


IndSpace  -  {IndSpace^),  i  = 

IndSpace^  =  One  •  /O  •  jQ 
IndSpace^  =  Two  •  /OO 

and  deleting  declaration  INDEX  lO.jO  from  the  text  of  the  program. 

The  result  of  performing  function  FAR'‘DOBlAIN”EXCLUDE(o),o=One,Two: 


MAIN  PART  One. 
BEGIN 
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OiiO:(  OilO;  OHO ). 

OijO:(  OlO:(iO=1..4-1):  OjO:(iO=1..4-l)  ). 

OiiO:OiO/jO=l ..4-2.  OnO:OIO/OiO-RIGHT(4-3). 

VARIABLE  aO.TOJsOJnO  DEFINED  ON  OljO.  VARIABLE  xO,yO  DEFINED  ON  OiO. 

VARIABLE  CondO  DEFINED  ON  OjiO.  VARIABLE  bO  DEFINED  ON  Oj'O. 

OiiYESO  .  OjiNOTO  :  OjiO/  TsO  <  0.5. 

INPUT  aO(FILE='data.deit')  ON  OljO,  bO(FILE=’deita.dat')  ON  OjO. 

INPUT  TO  ON  OijO/iO=1,jO=2..4-1,  TO  ON  01j0/j0=l,i0=2..4-l. 

OUTPUT  TO(FILE='results')  ON  OijO/IO=4-l,JO=4-l. 

OUTPUT  xO(FILE='resotts')  ON  OIO. 

FOR  OIO  ASSUME  xO  =  yO  +  SUM((OjO)aO*bO)-.  yO  =  b0D0=l01. 

FOR  OijO/iO=2..4-T.jO=2..4-1  ASSUME  TO  =  TOtiO-ll+TOO-lj;  TsO  =  SIN{T0). 

FOR  OljO/IO=l.jO=1..4-1  ASSUME  TsO  =  SIN(IO+jO). 

FOR  OljO/jO=l,IO=2..4-l  ASSUME  TsO  =  SIN(IO-jO). 

FOR  OiO  ASSUME 

COMPUTE  Two(TsO  ON  0lj0/j0=!0  RESULT  TnO  ON  OljO/iO=IO). 

OUTPUT  TnO(FILE=’results’)  ON  OljO.  CondO  ON  OjlYESO. 

FOR  OjlYESO  ASSUME  CondO  =  TnO. 

FOR  OjiNOTO  ASSUME  CondO  =  TsO. 

END  PART. 

PART  Tww).  TsOO  RESULT  TnOO 
BEGIN 

0100:000=1. .3). 

VARIABLE  TsOO.TnOO.TOO  DEFINED  ON  OlOO.  VARIABLE  EpsOO. 

INPUT  EpsOO.  INPUT  TOO(FllE=’data.dat')  ON  OiOO. 

ITERATION  TnOO  ON  NOO. 

INITIAL  NOO  =  0 : 

FOR  OiOO  ASSUME  TnOO  =  TsOO. 

END  INITIAL 

FOR  OiOO  ASSUME  TnOO  =  Tn00lN00-l]/3. 

EXIT  WHEN  ABS(Tn00ll00=3,N001)  <  EpsOO. 

END  ITERATION  NOO. 

END  PART. 

The  result  of  performing  function  CONST-EXPR-EVALUATE(o),o=One,Two  (part  Two 
isn’t  changed): 

MAIN  PART  One. 

BEGIN 

OnewO:(CIO=1):aO=2..3)).  Onewl:(GO=1):(IO=2..3)).  Onew2:((IO=3):aO=3)). 
Onew3;((i0=2..3);00=2..3)).  Onew4:((IO=l):00=1..3)).  Onew5:(00=l):(i0=2..3)). 

OjlO:(  a0=l-2);  10=1. .2 ). 

OijO:(  (10=1. .3)r  00=1 .3)  ). 

OiO:(iO=1..3).  OjO:aO=1..3). 

Ojj0:00=1..2).  OliO:(iO=1..2). 

VARIABLE  aOJOJsOJnO  DEFINED  ON  OljO.  VARIABLE  xO,yO  DEFINED  ON  OiO. 

VARIABLE  CondO  DEFINED  ON  OjiO.  VARIABLE  bO  DEFINED  ON  OjO. 

OjlYESO  ,  OjiNOTO :  OjiO/  TsO  <  0.5. 

INPUT  aO(FILE='data.dat')  ON  OljO,  bO(FILE='data.dat’)  ON  OjO. 

INPUT  TO  ON  OnewO,  TO  ON  Onewl. 

OUTPUT  TO(FILE=’results')  ON  Onew2. 

OUTPUT  xO(FILE='resuHs’)  ON  OIO. 

FOR  OiO  ASSUME  xO  =  yO  +  SUM((OjO)aO*bO).  FOR  OIO  ASSUME  yO  =  bODO=IO]. 

FOR  Onew3  ASSUME  TO  =  TOpO-lj+TDO-lj.  FOR  Onew3  ASSUME  TsO  =  SIN(T0). 

FOR  Onew4  ASSUME  TsO  =  SIN(IO+jO).  FOR  OnewS  ASSUME  TsO  =  SIN(IO-jO). 

A'lTUMf 

COMPUTE fwopsCTON  OijO/jO=IO  RESULT  TnO  ON  OijO/iO=IO). 

OUTPUT  TnlJ(HIS=’res«Hb’)  ON  OljO.  CondO  ON  OjlYESO. 

FOR  OiiYESO  ASSWUBE  CondO  =  TnO. 
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FOR  OjiNOTO  ASSUME  CondO  =  TsO. 

END  PART. 

The  result  of  perfotming  functions  RECT“DOMAINHEVALUATE(o),  DIAQ-DOMAIN* 
EVALUATE(o),o=OneJwo  is  filling  in  relation  Domains: 


Domains 

T 

OnewO 

iOjO 

1  2 

T 

Onewd 

iO  jO 

1  3 

T 

Onewl 

iOjO 

21 

T 

Onewl 

iOjO 

31 

T 

Onew2 

iOjO 

33 

T 

Onew3 

iOjO 

22 

T 

Onew3 

iO  jO 

23 

T 

Onew3 

iOjO 

32 

T 

Onew3 

iOjd 

33 

T 

Onew4 

iOjO 

1  1 

T 

Onew4 

iO  jO 

1  2 

T 

Onew4 

iOjO 

1  2 

T 

Onew5 

iO  jO 

2  1 

T 

Onew5 

iO  jO 

3  1 

T 

OjiO 

iO  jO 

1  1 

T 

OjiO 

iOjO 

1  2 

T 

OjiO 

iO  jO 

21 

T 

mSEM 

22 

T 

■Ol 

1  1 

Cond 

D 

Iname 

/  1 

T 

OijO 

iO  jO 

1  2 

T 

OijO 

iOjO 

1  3 

T 

OijO 

iOjO 

21 

T 

OijO 

iOjO 

22 

T 

OijO 

iOjO 

23 

T 

OijO 

iOjO 

31 

T 

OijO 

iOjO 

32 

T 

OijO 

iOjO 

33 

T 

OiO 

iO 

1 

T 

OiO 

iO 

2 

T 

OiO 

iO 

3 

T 

OjO 

jo 

1 

T 

OjO 

jo 

2 

T 

OjO 

jO 

3 

T 

OiiO 

iO 

1 

T 

OiiO 

iO 

2 

T 

OjjO 

jo 

1 

T 

OjjO 

jo 

2 

T 

CiOO 

iOC 

1 

T 

OiOO 

2 

T 

OiOO 

iOO 

3 

Besides  declaration  of  the  following  domains  are  deleted  from  the  text  of  part  One: 

OnewO:((IO=1):GO=2..3)).  Onewl:(00=l):(i0=2..3)).  Onew2:((iO=3);aO=3)). 
Onew3:((i0=2..3);00=2..3)).  Onew4:((iO=l);00=1..3)).  Onew5:((jO=1):(IO=1..3)). 

O|l0:(  00=1.  2):  i0=1..2  ). 

OiiO;(  (i0=1..3):  00=1-3)). 

OiO:(iO=1..3).  Oj0:00=1..3). 

OfiO:00=l-2).  OiIO:(iO=1..2). 

and  the  declaration  of  the  domain  given  below  is  deleted  from  the  text  of  part  Two: 

OiOO:(iOO=1..3). 

The  result  of  performing  functions  V'AR-DECLARATION(«)  and  REBUILD* 
INDEX(y),(>=One,Two  -  modification  of  prograrn  and  filling  in  relation  Variables: 

MAIN  PART  One. 

BEGIN 

OjIYESO  ,  OjiNOTO  :  OjiO/  TsO(IO.jO)  <  0.5. 

INPUT  a0(i0,j0)(FILE=’dcita.dat')  ON  OijO,  b000){FILE=’data.dcit')  ON  OjO. 

INPUT  T0(i0,j0)  ON  OnewO,  TO(iOjO)  ON  Onewl. 

OUTPUT  T0(i0,j0)(FILE='results’)  ON  Onew2. 

OUTPUT  x0(i0)(FILE=’resolts’)  ON  OlO. 

FOR  OlO  ASSUME  x0(l0)  =  yO(IO)  +  SUM({Oi0)a0(l0,i0)*b0a0)). 

FOR  OlO  ASSUME  y0{l0)  =  b0{l0). 

FOR  Onew3  ASSUME  T0(I0,|0)  =  T0(I0-1  J0)+T(l0,j0-1). 

FOR  Onew3  ASSUME  Ts0{l0,j0)  =  SIN(T0(l0,i0)). 
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FOR  Onew4  ASSUME  Ts0(10.j0)  =  SIN(iO+jO). 

FOR  OnewS  ASSUME  Ts0(i0.j0)  =  SIN(iO-jO). 

FOROiO  ASSUME 

COMPUTE  Two(TsO  ON  OliO/jO=IO  RESULT  TnO  ON  OliO/iO=IO). 
OUTPUT  TnO(iO,iO)(FllE=’resutts')  ON  OIjO,  CondOfiOjO)  ON  OjlYESO. 
FOR  OjIYESO  ASSUME  CondO(IO,jO)  =  TnOOOJO). 

FOR  OjlNOTO  ASSUME  Cond0(10 JO)  *  TsO(IO  jO). 

END  PART. 

PART  Two.  TsOO  RESULT  TnOO 
BEGIN 

INPUT  EpsOO.  INPUT  TOO(IOO)(FILE=’data.deit’)  ON  OiOO. 

ITERATION  TnOO  ON  NOO. 

INITIAL  NOO  8  0  : 

FOR  OiOO  ASSUME  Tn00(i00,N00}  =  TsOO(iOO). 

END  INITIAL 

FOR  OiOO  ASSUME  Tn00(l00,N00)  =  TnOOpOO.NOO-ll/S. 

EXIT  WHEN  ABS(Tn00[3,N001)  <  EpsOO. 

END  ITERATION  NOO. 

END  PART. 


Variables 

D 

X 

Iter 

OijO 

aO 

A 

OijO 

TO 

A 

OijO 

TsO 

A 

OiiO 

TnO 

A 

OiO 

xO 

A 

OiO 

yo  .. 

A 

OiiO 

CondO 

A 

OiO 

bO 

A 

OiOO 

TsOO 

A 

OiOO 

TnOO 

NOO 

OiOO 

TOO 

A 

0 

EpsO 

A 

The  result  of  performing  function  COND-DOMAIN-EVALUATE(o),o=One .Two  is  adding 
tuples  to  relation  Domains-. 


Domains 

Cond 

D 

Iname 

I 

(Ts0(1,1)<0.5) 

OiiYESO 

iO  jO 

1 1 

(Ts0(1.2)<0.5) 

OiiYESO 

iO  iO 

1  2 

(Ts0(2,1)<0.5) 

OiiYESO 

iOiO 

21 

(Ts0(2.2)<0.5) 

OiiYESO 

iO  iO 

22 

-i(Ts0(1,1)<0.5) 

OiiNOTO 

iO  jO 

1  1 

-,{Ts0(1,2)<0.5) 

OiiNOfo 

iO  iO 

12 

-,(Ts0(2,1)<0.5) 

OiiNOTO 

iOjO 

21 

-,rrs0(2,2)<0.5) 

OiiNOTO^ 

iO  iO 

22 

OjiYesO  ,  OjiNOTO  :  OjiO/TsO(iO,jO)  <  0.5. 
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The  result  of  performing  function  INPirT"DECLARAnON(o),  OUTPUT- 
DECLARAT10N(o),o=One,Two  -  modification  of  the  program  and  filling  in  relations Output, 
Relations. 

MAIN  PART  One. 

BEGIN 

FOR  OIO  ASSUME  xOpO)  =  yO(IO)  +  SUM((OiO)aO(iO,jO)*bOOO)). 

FOR  OiO  ASSUME  yO(iO)  =  bO(iO). 

FOR  OnewS  ASSUME  TO(fO,jO)  =  TO(iO-l.jO)+T(iOJO-l). 

FOR  Onew3  ASSUME  TsO(IO,iO)  *  SINaO(IO,iO)). 

FOR  Onew4  ASSUME  TsO(iO,iO)  =  SIN(IO+]0). 

FOR  OnewS  ASSUME  TsO(iO,jO)  -  SIN(iO-JO). 

FOR  OiO  ASSUME 

COMPUTE  TwoasO  ON  0110/10=10  RESULT  TnO  ON  OljO/IO=IO). 

FOR  OjiYESO  ASSUME  CondO(iO.jO)  =  TnO(iOJO). 

FOR  OjlNOTO  ASSUME  CondO(IO.jO)  =  TsO(IO,jO). 

END  PART. 

PART  Two.  TsOO  RESULT  TnOO 
BEGIN 

ITERATION  TnOO  ON  NOO. 

INITIAL  NOO  =  0 : 

FOR  OiOO  ASSUME  Tn00(i00,N00)  =  TsOO(IOO). 

END  INITIAL 

FOR  OIOO  ASSUME  Tn00(i00,N00)  =  TnOOIiOO.NOO-ll/S. 

EXIT  WHEN  ABS(Tn00I3,N00])  <  EpsOO. 

END  ITERATION  NOO. 

END  PART. 


Output 

Cond 

Var 

Value 

File 

T 

T0(3,3) 

A 

‘rezults’ 

T 

x0(1) 

A 

‘rezults’ 

T 

x0(2) 

A 

‘rezults’ 

T 

x0(3) 

A 

‘rezults’ 

T 

Tn0(1,1) 

A 

‘rezults’ 

T 

Tn0(1,2) 

A 

■rezults’ 

T 

A 

‘rezults’ 

T 

Tn0(2.1) 

A 

‘rezults’ 

T 

Tn0(2,2) 

A 

‘rezults’ 

T 

Tn0(2,3) 

A 

‘rezults’ 

T 

Tn0(3,1) 

A 

‘rezults’ 

T 

Tn0(3,2) 

A 

‘rezults’ 

T 

Tn0(3,3) 

A 

‘rezults’ 

(Ts0(1,1)<0.5) 

Cond0(1,1) 

A 

‘rezults’  " 

(Ts0(1 ,2)<0.5) 

bbhi 

‘rezults’ 

Cond0(2,1) 

A 

‘rezults’ 

Cond0(2,2) 

A 

‘rezults’ 

Cond0(1,1) 

A 

‘rezults’ 

IHHI 

‘rezults’ 

Cond0(2,1) 

‘rezults’ 

-,(Ts0{2,2)<0.5) 

‘rezults’ 

A 


Input 

Var 

Value  File 

KEBBII 

A 

‘data.dat’ 

A 

‘data.dat’ 

A 

‘data.dat’ 

A 

‘data.dat’ 

a0(2,2) 

A 

‘data.dat’ 

A 

‘data.dat’ 

a0(3,1) 

A 

‘data.dat’ 

A 

‘data.dat’ 

a0(3.3) 

.\ 

‘data.dat’ 

b0(1) 

.V 

‘data.dat’ 

b0(2) 

A 

‘data.dat’ 

b0(3)  I 

A 

‘data.dat’ 

T0(1,2) 

A 

‘norma.dat’ 

T0(1 ,3) 

A 

‘notma.dat’ 

T0(2,1) 

A 

‘norma.dat’ 

T0(3.1) 

A 

‘norma.dat’ 

A 

‘norma.dat’ 

TOOd) 

A 

‘data.dat’ 

100(2) 

A 

’data.dat’ 

T00(3) 

‘data.dat’ 
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Relations 

Name  ItVal  Def  Cond 

Var 

Value 

Func 

One 

0  0 

T 

T 

aOd.D 

A 

INPUT 

One 

0  0 

T 

T 

a0(1,2) 

A 

INPUT- 

One 

0  0 

T 

T 

a0(1,3) 

A 

INPUT 

One 

0  0 

T 

T 

a0(2,1) 

A 

INPUl- 

One 

0  0 

T 

T 

a0(2.2) 

A 

INPUT 

One 

0  0 

T 

T 

a0(2.3) 

A 

INPUT 

One 

0  0 

T 

T 

a0(3,1) 

A 

INPUT 

One 

0  0 

T 

T 

a0(3.2) 

A 

INPUT 

One 

0  0 

T 

T 

a0(3,3) 

A 

INPUT 

One 

0  0 

T 

T 

b0(1) 

A 

INPUT 

One 

0  0 

T 

T 

b0(2) 

A 

INPUT 

One 

0  0 

T 

T 

b0(3)  ^ 

A 

INPUT 

One 

0  0 

T 

T 

T0(1.2) 

A 

INPUT 

One 

0  0 

T 

T 

T0(1,3iZ 

A  ' 

INPUT 

One 

0  0 

T 

T 

T0(2.1) 

A 

INPUT 

One 

0  0 

T 

T 

T0(3.1) 

A 

INPUT 

Two 

0  0 

T 

T 

EpsOO 

A 

INPUT- 

Two 

0  0 

T 

T 

TOOd) 

A 

INPUT 

Two 

0  0 

T 

T 

T00f2^ 

A 

INPUT 

Two 

0  0 

T 

T 

T00(3) 

A 

INPUT 

The  result  of  performing  function  INFUT-DATA-PROCESSINQ  {Files)  is  filling  in  relation 
InputData: 


Var 

Value 

File 

bOd) 

7.0 

‘data.dat’ 

bOd) 

9.0 

‘data.dat’ 

TOd  ,2) 

1.1 

‘norma.dat’ 

T0d,2) 

2.2 

‘norma.dat’ 

TOd  ,2) 

3.3 

‘norma.dat’ 

TOd  .2) 

4.4 

‘norma.dat’ 

EpsOO 

1.0E-6 

‘norma.dat’ 

TOOd) 

-1.1 

‘norma.dat’ 

TOOd) 

-1.1 

‘norma.dat’ 

TOOd) 

-1.1 

‘norma.dat’ 

Input  Data 

Var 

Value 

File 

aOd.D 

2.0 

‘data.dat’ 

a0d,2) 

3.0 

‘data.dat’ 

a0d,3) 

-4.0 

‘data.dat’ 

a0(2,1) 

3.0 

‘data.dat’ 

a0(2,2)  1 

4.0 

‘data.dat’ 

a0(2,3) 

-1.0 

‘data.dat’ 

a0(3,1) 

5.0  1 

‘data.dat’ 

a0(3,2) 

5.0 

‘data.dat’ 

a0(3.'3) 

5.0 

‘data.dat’ 

bOd) 

5.0 

‘data.dat’ 

The  result  of  performing  function  CHECK-INPin'-DATA(P)  -  filling  in  the  values  of  attribute 
I'a/ue  in  relation  Input: 
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Var 

Value 

File 

7.0 

‘data.dat’ 

mmsm 

CO 

b 

‘data.daf 

T0(1,2) 

1.1 

‘norma.dat’ 

2.2 

‘norma.dat’ 

T0(1.2) 

3.3 

‘norma.dat’ 

■Eim 

4.4 

‘norma.dat’ 

1.0E-6  1 

‘norma.dat’ 

KESni 

‘norma.dat’ 

■Esim 

-1.1 

‘norma.dat’ 

■EQSni 

-1.1 

‘norma.dat’ 

Input 

Var 

Value 

File 

aO(l.l) 

2.0 

‘data.dat’ 

a0(1,2) 

3.0 

‘data.dat’ 

a0(1,3) 

-4.0 

‘data.dat’ 

a0(2.1) 

3.0 

‘data.dat’ 

a0(2,2) 

4.0 

‘data.dat’ 

a0(2,3) 

-1.0 

‘data.dat’ 

a0(3.1) 

5.0 

‘data.dat’ 

a0(3,2) 

5.0 

‘data.dat’ 

a0(3,3) 

5.0 

‘data.dat’ 

b0(1)  - 

5.0 

‘data.dat’ 

The  result  of  performing  function  CALLrQRAPH-CREATE(P)  is  designing  the  graph  of  functions 
and  parts'  cails  G=(F^,  V  =  {OneJwo(TsOORESUinnOO)},  V  =  (Onejwo).  The  result  of 
performing  function  CHECK-CALL-€LRAFH(P)  -  T. 

The  result  of  performing  function  OPERATORS“TO-RELAnONS(o)  -  addition  to  relation 
Relations  (i/r’-unique  names  obtained  in  the  result  of  using  function  uname{P)): 


Two(TsO  ON  OijO/jO=1 
REZULT 

TnO  ON  OijO/iO=1) 
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One 

0  0 

Def(TsO(1,2), 

Ts0(2,2), 

Ts0(3,2)) 

T 

Tn0(2,1). 

Tn0(2.2), 

Tn0(2,3) 

g 

Two(TsO  ON  OijO/jO=2 
REZULT 

TnO  ON  OijO/iO=2) 

One 

0  0 

Def(TsO(1,3), 

T 

Tn0(3,1). 

Two(TsO  ON  OijO/jO=3 

Ts0(2.3). 

Tn0(3.2), 

■■ 

REZULT 

Ts0(3,3)) 

Tn0(3.3) 

HI 

TnO  ON  OijO/iO=3) 

One 

0  0 

Ts0(1,1)<0.5 

Cond0(1,1) 

H 

Tn0(1,1) 

One 

0  0 

Ts0(l,2)<0.5 

Cond0(1,2) 

BH 

Tn0(1,2) 

One 

0  0 

Ts0(2,1)<0.5 

Cond0(2,1) 

A 

Tn0(2.1) 

One 

0  0 

Ts0(2,2)<0.5 

Cond0(2,2) 

Tn0(2,2) 

One 

0  0 

Def(Ts0(1,1)) 

A 

Ts0(1,1) 

One 

0  0 

Def(Ts0(1,1)) 

Cond0(1,2) 

A 

Ts0(1,1) 

One 

0  0 

Def(Ts0(1,1)) 

Cond0(2,1) 

A 

Ts0(1,1) 

One 

0  0 

Def(Ts0(1,1)) 

A 

Ts0(1,1) 

Two 

NOOO 

Def(Ts00(1)) 

T 

wasMssm 

A 

TsOOd) 

Two 

NOOO 

Def(Ts00(2)) 

T 

A 

Two 

NOOO 

Def(Ts00(3)) 

T 

Tn00(3.N00) 

A 

Two 

N00  1 

Def(Tn00(1,N00-1)) 

T 

TnOOd.NOO) 

A 

Tn00d.N00-1)/3 

Two 

NOO  1 

Def(Tn00(2,N00-1)) 

T 

Tn00(2,N00) 

A 

Tn00(2.N00-1V3 

Two 

NOO  1 

Def(Tn00(3,N00-1)) 

T 

Tn00(3.N00) 

A 

Tn00(3.N00-1)/3 

Two 

NOO  1 

Def(Tn00(3,N00). 

ABS(Tn00(3.N00)) 

0 

0 

0 

EpsOO) 

<  EpsOO 

Design  of  abstract  program  A(P)  is  finally  finished.  Now  here  is  a  brief  description  of  AA-I- 
machine  working  by  this  program  assuming  that  NonDeterm—ArgDefimd  and  NonDetermOnt  ~ 
OutDefined.  Thus  to  eliminate  non-determination  of  y4M-machine  and  not  to  specify  all  the  variants 
of  possible  computations  we  have  to  demonstrate  semantics  of  naturalOdeal)  parallelism. 
Initialization. 

Checking  CHECKHR.EASSI€iNMENT(Re/af/ons)  gives  T  after  that  function  CALL- 

PART(One)  is  performed  invoking  tuples  ^(Relations,  Name  =  One). 

Stage  of  interpretation  1. 

a)  Relation  NonDeterm=ArgDeflned  determines  computation  variables  a0(1,1),a0(1,2), 
a0(1,3),  a0(2.1),  a0(2,2).  a0(2,3),  a0(3,1),  a0(3,2).  a0(3.3),  b0(1),  b0(2),  b0(3).  T0(1,2),  T0{1,3), 
T0(2,1),  T0(3,1),  Ts0(1,1),  Ts0(1.2),  Ts0(1,3).  Ts0(2,1),  Ts0(3,1).  The  values  of  these  variables  are 
computed  corresponding  to  function  fimc  set  in  the  field  of  attribute  Func. 

b)  CHECK-STOP  =  nextAMstep 
Stage  of  interpretation  2. 

a)  Relation  ArgDefmed  determines  the  computation  of  variables  u1,  u2,  u3,  y0(1),  y0(2), 
y0(3),  T0(2,2),  {Tn0(1,1).  Tn0(1,2),  Tn0(1,3)}.  The  values  of  these  variables  are  computed. 
Computation  of  ^n0(1 ,1),  Tn0(1 ,2),  Tn0(1 ,3)}  engenders  call  of  function 

CALL-PART(Two(TsO  on  OijO/jO=1  RESULT  TnO  ON  OijO/iO=1 )) 
and  invoking  the  tuples  of  part  Two. 

An  extract  of  relation  Relations  is  -obtained  as  the  result  of  invoking  the  tuples  of  part  Two. 
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Name 

wsm\ 

Def 

Cofid 

^■^^■11 

Two 

0  0 

T 

T 

EdsOO 

■iMraH 

Two 

0  0 

T 

■■■■■11 

■BhI 

BHBI 

■TMogi 

Two 

0  0 

T 

■■■Ml 

■SS 

Two 

0  0 

T 

■■■■■11 

■BH 

Two 

0  0 

Def(TsO(1.1)) 

T 

u4 

Two 

0  0 

T 

u5 

HHI 

Two 

0  0 

Def(TsO(3,1)) 

T 

u6 

A 

■mni 

Two 

0  0 

Def(u4) 

T 

TsOOd) 

A 

u4 

Two 

0  0 

Def(u5) 

T 

Ts00(2) 

A 

u5 

Two 

0  0 

Def(u6) 

T 

A 

Two 

NOOO 

T 

TnOOd.O) 

A 

Two 

Nood 

T 

Tn00(2.0) 

A 

HESSIilSlH 

Two 

NOOO 

T  _ 

Tn00(3.0)  1 

A 

Ts00(3) 

Two 

N00  1 

DefanOO(I.NOO-l)) 

T 

A 

Tn00d,0)/3 

Two 

NOO  1 

T 

Two 

NOO  1 

Def{Tn00(3,N00-1)) 

T 
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b)  CHECK-STOP  =nextAMstep 

Stage  of  interpretation  3  Tn/o<»\ 

a)  relation  ArgDefined  determines  the  computation  of  variables  x0(1),  x0(2),  x0(3),  T0(2,3), 
T0(3,2).  Ts0(2,2).  u4,  u5,  u6.  EpsOO.  T00(1).  T00(2).  T00(3).  The  values  of  these  variables  are 

computed.  OutDefwed  determines  variables  x0(1)._  x0(2).  x0(3).which  values  has  been 

computed.  Output  of  these  values  is  carried  out  into  file  ‘results’. 

c)  CHECK-STOP  =nextAMstep 
Stage  of  interpretation  4. 

a)  Relation  /IrgDe/rwt/ determines  the  computation  of  variables  T0(3,3),  Ts0(2,3).  TsO(3,Z) 
u4  u5.  u6,  Ts00(1),  Ts00(2),  Ts00(3).  The  values  of  these  variables  are  computed. 

b)  Relation  OutDefmed  determines  variable  T0(3,3)  which  value  has  been  computed.  Outpu 
of  these  values  is  carried  out  into  file  ‘results’. 

c)  CHECK-STOP  ^mxtAA'lstep 
Stage  of  interpretation  5. 

a)  Relation  ArgDefmed  determines  the  computation  of  Ts0(3,3),  {Tn0(2,1),  TnO(  ,  ), 
Tn0(2  3)}.  Tn00(1,0),  Tn00(2,0).  Tn00(3,0).  The  values  of  these  variables  are  computed.  Computation 
of  {TiioCIJ),  Tn0(1 ,2),  Tn0(1 ,3)}  engenders  call  of  function 

CALL-PART(Two(TsO  on  OijO/jO=2  RESULT  TnO  ON  OijO/iO=2)) 
and  invoking  the  tuples  of  part  Two  (this  process  and  the  computations  determined  by  it  aren’t 
described  to  make  the  text  shorter). 

c)  CHECK-STOP  =nextAMstep 
Stage  of  interpretation  6. 

a)  Relation  ArgDefmed  determines  the  computation  Tn00(1,1),  Tn00(2,1).  Tn00(3,i), 
{Tn0(3,1),  Tnd(3,2),  Tn0(3,3)}.  The  values  of  these  variables  are  computed.  Computation  of 
{TnOfs’.l)!  Tn0(3.2),  Tn0(3.3)}.  engenders  call  of  function 
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CALL-PART(Two(TsO  on  OijO/jO=3  RESULT  TnO  ON  OijO/iO=3)) 
and  invoking  the  tuples  of  part  Two  (this  process  and  the  computations  determined  by  it  aren’t 
described  to  make  the  text  shorter). 

Exit  condition  of  iteration  ABS(Tn00(3,l))<Eps00  is  checked  and  either  the  next  step  to 
compute  Tn00(1,1),  Tn00(2,1),  Tn00(3.1)  is  done  or  passing  the  result  is  performed.  Let's  assume 
that  iteration  is  finished. 

c)  CHECK-STOF  =mxtAMstep 
Stage  of  interpretation  7. 

a)  Relation  ArgDefmed  determines  the  computation  of  u7,  u8,  u9.  The  values  of  these 
variables  are  computed. 

c)  CHECK-STOP  -nextAMstep 
Stage  of  interpretation  8. 

a)  Relation  ArgDefined  ieXemmes  the  computation  of  Tn0(1,1),  Tn0,(1,2),  Tn0(1,3).  The 
values  of  these  variables  are  computed. 

b)  Relation  OutDefined  determines  variables  Tn0(1 ,1),  Tn0(1 ,2),  Tn0(1 ,3)  which  values  have 
been  computed.  Output  of  these  values  is  carried  out  into  file  ‘results’. 

c)  CHECK-STOP  -nextAMstep 
Stage  of  interpretation  9. 

a)  Relation  ArgDefined  determines  the  computation  of 

Cond0{1,1)  (ifTs0(1,1)<0.5), 

Cond0(1,2)  (ifTs0(1,2)<0.5). 

Cond0(1,1)  (if  -.(Ts0(1.1)<0.5)), 

Cond0(1,2)  (if  -i(Ts0(1,2)<0.5)). 

The  tuples  determining  the  computation  of  Cond0(1 ,1)  (if  -i(TsO(1 ,1)<0.5)), 

Cond0(1,2)  (if  -i(Ts0(1,2)<0.5))  are  eliminated  from 

ArgDefined  and  ARelations  as  the  condition  Cond=¥. 

The  values  of  the  variables  Cond0(1,1)  (if  Ts0(1,1)<0.5), 

Cond0(1,2)  (ifTs0(1,2)<0.5)  are  computed. 

b)  Relation  OutDefijied  6ef\r\es  variables  Cond0(1,1),  Cond0(1,2)  which  values  have  been 
computed.  The  tuples  describing  output  Cond0(1 ,1)  (if->(TsO(1 ,1)<0.5)),  Cond0(1 ,2) 

(if  -i(Ts0(1,2)<0.5))  are  eliminated  from  OutDefined  as  condition  Cond=¥.  Output  of  the  values 
Cond0(1,1)  (if  Ts0(1,1)<0.5),  Cond0(1,2)  (ifTsO(1 ,2)<0.5)  is  carried  out  into  file  “results”. 

c)  CHECK-STOP  ^nextAMstep 
Stage  of  interpretation ... 

CHECK-STOP  =nextAMstep 

2.10  Definite  work  of  .4M-machine 

The  value  of  variable  X  eARelationsiVar)  are  computable  if  it  has  no  information 
dependencies  or  the  values  of  all  variables  T,,...,y„,from  which  has  information  dependencies 
have  been  computed. 

COMPUTABLE(^  ~ 

3a(a  e  ARelations  a  a  =  {tiame  o  itval  °  def  o  cond  o  var  o  value  <>  fitnc) 

aA”  =  var  A  IDQ-E?PRi/[^)  =  T  a  IDQ-EJPB^cowi/)  =  T  a  value  ^  A) 

(In  this  definition  function  LOe-EXPR(t?)  computes  the  value  of  logical  expression  e). 

Computation  of  function  fuTic  =  ^(ARelations,  Var  =  X)(Func)  if  OOMPUiaaE(Ar)  s  T 

is  called  computation nf  tho  value  of  variable  X  e  ARelationsfVar ) . 

CO!»K*DTATlON(;!0  ~3a(a  e  ARelations /\  Xccola  OCaiPUiaBE(Ar)) 
OaiMFUE(a)) 
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Variable  X  is  determined  definitely  if  program  P  satisfies  the  condition  of  singie-assignment 
and  X  e  ARelatiom{Var) . 

DEF-ONE-T(H)NE(J!Q  ~  CHECKHElEASS16NMENT(/?e/a/io«s) 

aX  e  ARelations(yar) 

Program  P  is  correct  if  it  is  semantically  correct  and  the  work  of  /4M-machine  is  terminated. 
Theorem.  All  the  values  of  the  variables  are  determined  definitely  for  correct  prog^m  P. 

Proof. 

Work  of  ylAZ-machine  finishes  in  state  stop  for  correct  program  so  the  numter  of 
interpretation  stages  is  finite  and  according  to  definition  of  state  stop  the  values  of  all  the  variables 
which  are  to  be  computed  are  computed. 

The  condition  of  single-assignment  is  fulfilled  for  correct  program  as  function  CHECK- 
REASSIGNMENT  is  done  by  AM-madtme.  Thus  ail  the  variables  of  program  P  are  determined 
definitely  in  accordance  with  the  definition  of  definite  determination. 

The  arbitrariness  of  choosing  set  NonDeterm  c  ArgDeflned  at  each  stage  of 

interpretation  and  computation  of  the  variables’  values  from  Nondeterm  don’t  breake  definite 
determination  of  the  variables. 

2.11  Notes  to  semantics’  specification 

1)  The  specification  of  the  NORMA  language’s  semantics  given  above  is  a  shortened 

specification.  Thus  some  functions  of  low  level  (e.g.  HEAD(/),  TAIL(/),  /-  list)  and  also  the  number 
of  functions  fixing  semantic  errors  aren’t  defined.  It  is  done  to  make  the  text  of  report  shorter. 
Descriptions  of  functions  fixing  semantic  errors  are  to  be  given  in  working  out  of  NORMA  translator’s 

realisation .  . 

2)  The  specification  of  the  NORMA  semantics  isn’t  to  be  considered  as  the  complete  one. 
Some  more  details  and  modifications  are  possible  to  be  added  in  the  process  of  further  researching.  It 
may  specify  the  rules  of  the  NORMA  language  interpretation.  Particularly  the  methods  of  specifying 
computational  environment  where  Norma  program  can  be  carried  out  are  under  consideration  now. 
The  result  of  this  research  may  give  more  details  to  the  NORMA  language. 

3)  Definition  of  an  abstract  machine  AM-madcme  is  a  theoretical  model  which  is  the  base  for 
working  out  algorithms  of  Norma  program  translation.  This  problem  isn’t  easy  as  the  algorithms  must 
be  effective  by  time  and  memory,  consider  the  peculiarities  of  the  target  computer  and  be  equivalent 
to  the  notion  of  y4A/-machine.  Obtained  theoretical  results  (e.g.  [3])  and  worked  out  experimental 
version  of  the  NORMA  language’s  translator  allow  to  consider  this  problem  practically  solvable. 

4)  Approach  used  in  the  semantics’  specification  is  based  on  the  representation  of  information 
dependencies  between  the  variables  from  NORMA  program  by  means  of  relations  from  relational 
algebra  and  the  rules  of  interpreting  these  relations.  This  approach  may  be  used  for  the  specification 
of  the  semantics  of  the  languages  similar  to  NORMA  which  may  be  used  for  solving  problerris  in  other 
application  domains,  e.g.  in  the  domain  of  relational  data  bases.  To  specify  the  semantics  of  the 
languages  of  considered  type  one  may  use  other  approaches  based  on  the  data  driven  computation 
model,  e.g.  data  flow  model. 
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Abstract. 

The  NORMA  language  is  a  tool  aimed  at  automatic  solution  of  the  mathematical  physics  problems  on 
parallel  computer  systems. 

The  aim  of  the  NORMA  language  is  to  eliminate  the  programming  phase  which  is  necessary  to  pass 
from  computational  formulae  derived  by  an  application  specialist  to  a  computer  program.  There  is  no 
essential  difference  between  computational  formulae  and  NORMA  program  structures  -  these  formulae  are 
an  input  for  the  NORMA  translating  system. 

In  fact  NORMA  program  is  a  nonprocedural  specification  of  problems  to  be  solved.  The  mathematical 
problems  connected  with  the  synthesis  of  output  program  are  solvable  in  the  case  of  the  NORMA  language. 

Draft  specification  of  the  NORMA  language  is  given. 
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Introduction. 

NORMA  is  a  declarative  language  aimed  at  computation  tasks*  specifications.  Translator  of  the 
language  has  besides  traditional  functions  of  semantic  and  syntax  analysis  a  function  of  output  program 


synthesis  during  the  translation  .In  other  words  the  mode  and  the  order  of  computations  depended  on  target 
computer  language  and  architecture  are  arranged. 

The  principal  ideas  of  automated  program  design  based  on  task’s  specification  were  formulated  by  I.B. 
Zadykhailo  [1]  even  in  1963.  Their  further  development  gave  birth  to  NORMA  and  several  versions  of  the 
translator  [2*14], Some  examples  of  NORMA  practical  usage  are  described  in  [15,16]. 

Authors  consider  the  approach  used  in  Norma  design  and  realisation  useful  for  creation  of  new  generation 
languages.  The  development  of  parallel  computers  enhances  the  value  of  this  method. 

This  paper  describes  syntax  and  semantics  of  the  NORMA  language. 

1.  Technical  background. 

The  idea  of  the  NORMA  language  was  produced  by  applied  mathematicians  from  KIAM  (Keldysh 
Institute  of  Applied  mathematics)  RAS  (Russian  Academy  of  Science).It  was  an  attempt  of  automated 
program  design  based  on  the  jobs  (  application  of  numerical  methods  to  physical  problems  solution) 
prepared  for  further  programming. 

It  was  1960  when  the  first  works  on  this  theme  appeared  in  our  institute  (e.g.,[17]).The  method  of 
writing  jobs  in  this  work  was  called  “parameter  record”.  Later  on  such  methods  had  the  name  “non-procedure 
specifications”  and  afterwards  they  were  called  “declarative  specifications”.  The  languages  used  these 
mcihods  are  of  specification  type  where  the  rules  of  data  computations  must  be  defined  but  llie  order  of 
computations  may  be  arbitrary  (e.g.  in  parallel  or  sequential  mode)  according  to  the  rules. 

Many  researches  on  programming  theory  and  practical  results  proved  the  correctness  of  the  chosen 
path.  Some  corroborating  aspects  are  given  below. 


2.  Aims. 

Practical  value  of  application  software  design  depends  on  the  degree  of: 

1 )  automation  of  design  process, 

2)  portability  and  reliability  of  the  output  program, 

3) convenience  for  the  specialist  in  particular  application  domain. 

The  probieins  of  providing  all  these  items  has  been  solving  for  the  whole  period  of  computer  usage 
but  there  have  been  no  universal  and  perfect  methods  for  their  solution  yet.  It  can  be  explained  by 
complicity  of  these  problem  and  ver>'  fast  development  of  the  programming  as  a  branch  of  science  (new 
application  domains  and  new  hardware  appear  very'  quickly). 

Progress  in  computer  technology  and  architecture  provides  ample  computational  potentialities.  But 
taking  into  consideration  [1-3]  requirements  their  perfect  practical  application  has  many  difficulties.  Thus 
when  multiprocessor  computers  appeared  the  question  of  new  application  software  design  was  raised. 


Probably  application  programmer  wants  to  know  as  little  as  possible  about  the  architecture  of  a 
particular  computer  but  to  use  its  resources  as  much  as  he  could.  He'd  like  to  have  convenient  tools  for  test 
and  debugging  but  no  problems  with  programs’  portability.  To  work  using  generic  methods  and  terms  is 
also  very  important  for  application  programmer.  He  needn’t  know  questions  which  are  alien  to  him 
[compiler’s  peculiarities,  programming  languages,  system  library  organization  and  the  questions  of  its 
effective  usage,  etc.]. 

The  NORMA  language  and  the  approach  to  application  program  design  is  an  attempt  to  take  into 
consideration  [1-3]  requirements.  The  authors  of  this  approach  realize  the  difficulties  of  the  fundamental  task 
solution  and  the  fact  that  universal  and  perfect  way  to  this  solution  couldn’t  be  found  immediately  Though 
we  can  point  out  some  goals  of  these  approach  even  now. 

3.  The  goals  of  NORMA  usage. 


High  level  of  automation  of  the  application  design  process. 

Initial  equations  are  formalized  only  to  make  input  from  keyboard  easy.  It  is  important  that  the 
issue  of  the  formulae  isn’t  changed.  Thus  the  programming  phase  is  skipped  and  further  design  of  the 
program  is  carried  out  automatically 

Reliability  and  portability  of  the  programs. 

If  the  method  of  the  task  solution  is  right  and  the  formulae  are  written  correctly  an  output  program 
will  be  also  correct  (  because  the  process  of  programming  is  automatic). 

We  have  to  underline  that  the  strict  specification  of  the  task  solution  process  on  the  application 
domain  level  is  very  important.  This  level  is  the  most  reliable  because  it  deals  only  with  the  scheme  of  the 
computations  and  doesn't  depend  on  the  optimizations  which  can  be  realized  during  programming  phase.  It 
is  evident  that  if  we  propose  the  language  for  the  strict  specification  of  the  task  solution  process  in  the 
terms  of  application  domain  automatically  designed  program  based  on  this  specification  will  have  the  degree 
of  reliability  corresponding  to  the  one  of  the  translator  and  the  translator's  optimization  level. 

NORMA  used  only  the  terms  of  application  domain  associated  with  the  grid-based  mathematical 
physics  tasks  solutions.  Simplicity  and  reliability  of  the  programs  arc  provided  by  eliminating  of  the 
programming  phase  and  required  only  the  knowledge  of  application  domain  terms  and  their  correct  usage. 

Synthesising  translator  from  NORMA  allows  for  computer  architecture  peculiarities.  It  provides 
portability  of  the  application  programs. 


Method  of  parallel  program  design. 


Generic  mathematical  formulae  usage  in  NORMA  has  great  possibilities  for  problem  optimization 
in  all  cases  including  realization  on  parallel  systems. 

The  specification  of  the  task  solution  keeps  its  natural  parallelism.  This  specification  doesn’t 
contain  any  adaptation  constraints  to  computer  architecture,  programming  language  peculiarities  or  other 
requirements.  It  is  “clear”  specification  of  task  solution  which  restricts  only  computation  order  according  to 
the  computation  scheme  content  (informatic  dependencies  between  variables).There  is  no  term  “memory” 
for  storing  different  values  in  different  periods  of  time.  It  simplifies  natural  parallelism  revealing 
algorithms  design  and  creation  of  the  target  program. 

In  the  cases  where  computation  order  is  important  ( e.g. .  it  affects  the  precision  of  computations, 
rate  of  convergence,  etc.)  supplementary  notions  may  be  used  ( they  are  interpreted  in  a  special  way). 

The  development  of  the  language  makes  its  life  longer.  NORMA  is  rather  young  and  capable. 


Norma  language  specification. 


1.  Application  domain  and  main  characteristics. 

The  NORMA  language  is  a  specialized  language  applied  to  numerical  -  based  specification  of  the 
mathematical  physics  problems  solutions.  First  it  was  directed  towards  solving  mathematical  physics  ‘ 
problems  by  means  of  grid  difference  method  solutions.  But  later  practice  shows  us  that  NORMA 
application  domain  could  be  more  extensive. 

Originally  NORMA  could  be  deciphered  as  :  Non-procedure  Specification  of  the  Difference 
Algorithm  Models.  Nowadays  we  decode  this  abbreviation  as  NORMAl  level  of  computer  -  mathematician 
communication.  Mathematician’s  formulae  are  input  almost  directly  into  the  computer  system. 

NORMA  requires  no  information  about  computation  order  and  ways  of  computation  process 
organization.  The  order  of  the  sentences  may  be  arbitrary  (  informatic  connections  are  revealed^during  the 
translation). 

Value  can  be  assigned  to  any  variable  in  NORMA  only  once.  This  feature  characterizes  the  level  of 
the  NORMA  language.  There  is  no  such  terms  as  “memory”,  side  effect,  assignment  statement,  control 
operators  in  NORMA. 

These  characteristics  and  some  other  constraints  (  first  on  the  form  of  index  statements  and  ways 
of  index  domain  specifications)  substantiate  solvability  of  output  program  synthesis  problem  [6,9]. 
Generic  solution  of  this  problem  has  substantial  mathematical  difficulties  (  the  task  may  be  NP-complete  or 
undecidable).On  the  other  hand  researches  on  NORMA  applications  and  design  show  that  the  constraints  are 
acceptable  practically  [15,16]. 

2  Syntax  notation. 


Extended  Backus  -  Naur  form  is  used  in  the  given  syntax  notation. 
{A}^^  {A}\  {A,. _ A„}.[A]  symbols  mean 


0  I  A  I  A  A 
A 


{A}* 

{Ar 


A  A  ... 


{A,.__A„  ::=  A,  | ...  |  A  „ 

}  ::=  0  I  A  , 

[A] 


where  A  -  any  language  object , 

0  -  empty, 

I  -  alternative  choose, 

...  etc. 

Syntax  notions  are  written  in  italics,  but  words  and  symbols  of  common  usage  are  printed  in  a 
usual  way.  As  a  rule,  alternative  constructions  are  arranged  in  columns  (each  construction  on  a  separate 
line). 

Sometimes  half-underlined  syntax  constructions  are  used,  e.g.  name  :_5£L-Syntactically  this  name 
is  identical  to  symbol  name  but  underlined  part  of  the  construction  has  additional  semantic  information. 

Symbol  list-element  substitutes  non-empty  list  of  elements  enumerated  by  comma: 

Wst-element : 

element  {  .element  }* 

The  definition  of  the  element  is  given  in  every  particular  case. 

Usually  syntactical  rules  are  given  first;  commentaries,  semantic  explanations,  examples,  etc. 

follow. 


3.  Principal  language  elements. 


3.1  Lexical  rules. 

Symbols  are  a  base  for  all  the  constructions,  they  are  original  language  elements.  The  number  of 
principal  symbols  is  fixed  and  used  for  the  design  of  any  language  constructions.  The  number  of 
supplementary  symbols  isn’t  fixed  and  settled  by  the  type  of  computer  equipment.  They  are  used  for 
symbolic  constants  formation  of  symbolic  constants  and  data  representation  for  medium. 

3.1.1.  Principal  symbols. 

principal-symbol : 
letter 
digit 

special-symbol 


letter : 


{  A3,C,D,E,F,G,H,UK,L.M,N,0,P,Q,R,S,T,U,V,W,X,Y,Z  } 

{  a,b,c,d,e,f,g,h,ij  k,l,m,n,o,p,q,r,s,t,u,v,w,x,y,z} 

digit : 

{  0,1,2,3A5,6J.8,9  } 

special-symbol : 

special-symbol-not-apostrophe 

apostrophe 

special-symbol-not-apostrophe : 

{  space 
apostrophe : 

i 

space  : 

Space  has  no  graphical  representation. 

3.1.2.  Commentaries, 

The  line  which  begins  from  the  sign  or  the  part  of  line  which  follows  the  sign  is  a  commentary. 

3.1.3.  Tokens, 

There  are  5  classes  of  tokens:  identificators,  key  -  words,  constants,  operation  signs,  delimiters. 

Spaces,  LF,  commentaries  are  considered  tokens’  delimiters  and  have  no  influence  on  the  program 
semantics. 

3. 1 .4.  Identificators, 
identificator : 

letter  {  {  digit,  letter  }  }  * 

Tlie  letters  of  upper  register  differ  from  the  ones  of  low  register.  Identificators  can  have  any  length.  The 
number  of  the  meaningful  symbols  is  defined  in  the  specification  of  the  translator’s  source  language 
(usually  first  6  symbols  are  meaningful). 

3.1.5.  Keywords. 

Tliese  key-words  are  used  in  the  language: 


MAIN  PART 
PART 


VARIABLE 
DEFINED  ON 


ITERATION 

BOUNDARY 


FUNCTION 

REAL 

END  BOUNDARY 

BEGIN 

INTEGER 

INITIAL 

END  PART 

DOUBLE 

END  INITIAL 

RESULT 

MACRO  INDEX 

EXIT  WHEN 

EXTERNAL 

FUNCTION 

DOMAIN 

END  ITERATION 

PARAMETERS 

EXTERNAL 

PART 

INPUT 

FOR 

DISTRIBUTION  INDEX 

OUTPUT 

ASSUME 

INDEX 

ON 

COMPUTE 

3.1.6.  Constants, 
constant : 

arithm-constant 

string 

arithm-constant: 
int<onstant 
real-constant 
double-constant 
int-constant : 

[  digit 

real-constant : 

body  [  E  [{+,-}  ]  power  ] 
double-constant : 

body  [  D  [{+,-}  ]  power  ] 

body : 

int-constant .  int-constant 


power 

int-constant 

string 


'  {  {  symhol-not-apostrophe.  ‘ 
syinhol-not-apostroplie: 
letter 
digit 

special-  symbol-not-apostrophe 
supplementary-symbol 


The  rules  of  writing  constants  are  similar  to  the  ones  in  the  Fortran  language. 

The  type  and  the  value  of  the  constant  is  defined  by  the  way  of  its  representation.  These  are  examples  of 
arithmetical  constants  : 

101  999  0.1  1.0 

10.5E.6  l.OE+7  1.0E7  O.lD-8  15.333D4 

These  are  examples  of  the  strings  : 

‘Hello,  world!’ 

‘ia  “ai  oaea  =  ‘ 

3.1.7.  Signs  of  operations, 
operation : 

{real-operation,  log-operation,  arithm-operation  } 
real-operation: 

{ =,>,<,<=,>=,<>,>< } 

real-except-not-equal-operation : 

{ =,>,<,>=,<=  } 

log-operation : 

{  AND,  OR,  NOT  } 

cu 'it! un-op c i a ti on  : 

{  ) 

Symbols  >=  and  <=  are  used  for  comparisons  >  and  <  ,  <>  and  ><  symbols  -  for  ^  . 

Delimiters, 
delimiter : 

3.2.  Names. 
name  : 

identificator 
Wsl-naine : 

name  {  .name 

Name  is  an  identificator.  It  has  a  panicuiar  meaning  and  defines  a  particular  program's  object  (e.g. 
variable,  domain,  part  of  the  program).  Tliere  can't  be  two  definitions  of  the  objects  which  have  the  same 
name. 

The  rule  of  name  localization  is  true  for  every  program  element  (  part  or  function  ).  The  names  declared 
in  the  program  element  are  localized  in  it.  There  is  no  notion  “  global  variable  “  in  this  language.  The 


names  of  the  functions  or  the  parts  if  they  are  actual  or  formal  parameters  of  other  parts  or  functions  are 
indicated  in  the  external  names  declaration  EXTERNAL  FUNCTION  or  EXTERNAL  PART  [see 
5.16]. 

Half  -  underlined  syntactical  symbols  e.g.  name-set  are  syntactically  equivalent  to  symbol  name. 
Underlined  part  of  the  construction  has  additional  semantic  information. 

Symbol  list-name  substitutes  non-empty  list  of  the  names  enumerated  by  comma. 


3.3  Basic  data  types. 

There  are  basic  data  types  specified  in  the  language: 

integer 

real 

double 

logical 

string 

Value  from  the  set  of  integers  is  assigned  to  the  element  of  integer  data  type.  There  are  arithmetical  and 
comparison  operations  for  integer  type  elements. 

Value  from  the  set  of  leai  numbers  is  assigned  to  the  element  of  real  data  type.  There  are  arithmetical 
and  comparison  operations  for  real  type  elements. 

Value  from  the  set  of  real  numbers  is  assigned  to  the  element  of  double  data  type  and  it  allows  to  use 
double  precious  real  numbers.  There  are  arithmetical  and  comparison  operations  for  double  type  elements. 
There  are  variables  of  integer,  real  and  double  type  in  the  language. 

Value  “true”  and  “false”  is  assigned  to  the  element  of  logical  data  type.  It  is  used  for  definition  of 
condition.  There  are  logical  operations  for  logical  type  elements.  There  are  no  variables  of  logical  type  in 
the  language. 

The  element  of  string  data  type  is  a  set  of  symbols  from  principal  symbol  set  and  supplementary 
symbol  set.  There  are  only  constants  but  no  variables  and  operations  of  this  type. 


3.4.  Expressions. 


3AJ,  Arithmetical  expressions, 
arithm-expression : 

I  {  •  }1  {  arithm-operation  term 

term : 

arihm-constant 
calUfnnction 
name’ index 

name-domain-parameter 
name-scalar 
variable-on-domain 
( arithm-expression  ) 
variable-on-domain : 

name-variable-on-domain [  [list-index-expression]  ] 
const-expression : 

If  }  ]  const-expression-withoiit-sign 
const- expression-without-sign  : 

integer-term  {  arithm-ope ration  integer-term  /* 
integer-tenn  : 
int-constant 
name-domain-parameter 
(  const-expression ) 
index-expression : 

name-index  [{  }  I  const-expression-withoiit-sign  J 

name-index  =  const-expression-without-sign 

name-index  =  name-index  //+,-//  const-expression-without-sign  J 
name-index-construction 

The  result  of  the  index  expression  compulation  in  a  fixed  point  T  of  ilie  domain  (5.1.1.)  is  coordinates 
of  the  point  T\  These  coordinates  can  be  defined  according  to  the  rules  : 

(1)  all  ilic  names  of  the  index  constructions  included  into  the  index  expression  tuv  suhsiiiuted  to  the 
corresponding  lists  of  index  expressions  from  MACRO  INDEX  declaration  ; 

(2)  indexes  are  substituted  to  the  constant  expressions,  values  of  T  coordinates  (taking  into  account 

index  displacement  if  it  is  given)  or  values  of  the  other^iven  indexes  in  the  point  T. 


For  example,  the  list  of  index  expressions  [MyIJ,k=:7,l=i+4]  for  the  point  T  with  /=9y=15,Jt=3,/=37 
(if  we  have  the  declaration  MACRO  INDEX  MylJ  [i-l,j])  gives  us  the  value  of  the  point  T  from 
index  domain  with  5,^=7,/=  13  coordinates. 

The  order  of  arithmetical  expression  computation  is  : 

( 1 )  index  expressions  computation; 

(2)  finding  arguments  (actual  parameters)  of  the  functions  (  including  determination  of  reduction 
functions  operation  domain  [5.2.3]; 

(3)  carrying-out  in-brackets  (  and  )  operations:  if  there  are  several  brackets  of  the  one  level,  they  are 
examined  from  the  left  to  the  right; 

(4)  carrying-out  raising  to  the  **  power  operations:  if  there  are  several  operations  of  the  one  level,  they 
are  examined  from  the  right  to  the  left; 

(5)  carrying-out  of  multiplication  and  division  *  and  /  operations;  if  there  are  several  operations  of  the 
one  level,  they  are  examined  from  the  left  to  the  right; 

(6)  carrying-out  of  sum  and  difference  +  and  -  operations;  if  there  are  several  operations  of  the  one 
level,  they  are  examined  from  the  left  to  the  right. 

The  type  of  arithmetical  expression  result  is  defined  by  the  types  of  operation  results.  The  type  of 
operation  result  can  be  : 

(1)  integer,  if  both  operands  are  integer  : 

(2)  double,  if  even  one  operand  is  double  ; 

(3)  real  in  any  other  cases. 

Constant  expression  is  a  particular  case  of  arithmetical  expression.  The  examples  of  arithmetical 
expressions  : 

AL12[J=2*M-1]+STEPLP*JJ 

Q+1.0D5/((R[Q.1]-R).(R[Q.1]-R[Q+1])) 

CNhcTRAPCSTEP,  2*MV+1,  MMR  on  POLL) 

3.4.2.  Conditional  expressions, 
loi^-expression  : 

[  NOT  ]  logic-term  {  log-operation  logic-term  } 
logic-term  : 
comparison 
( log-expression ) 
comparison  : 

arithm-expression  comp-operation  arithm-expression 


condition-on-domain : 
log-expression 

condition-on- index : 

name-index  rel-except-not-equal-ope ration 

name -index  [  {  +,-  }const-expression-without-sign  ] 
name-index  rel-except-not-equal-operation  const-expression-without-sign 

The  order  of  operations  for  logical  expression  computation  : 

(1)  carrying  out  of  arithmetical  expressions  computation; 

(2)  carrying  out  of  in-brackets  (  and )  operations;  if  there  are  several  brackets  of  the  one  level,  they  are 
examined  from  the  left  to  the  right. 

(3)  carrying  out  of  comparisons  =,>,<,>=,<=,<>,><  ;  if  there  are  several  operations  of  the  one  level, 
they  are  examined  from  the  left  to  the  right. 

(4)  carrying  out  of  logical  negation  NOT  operations;  if  there  are  several  operations  of  the  one  level, 
they  are  examined  from  the  left  to  the  right. 

(5)  carrying  out  of  logical  multiplication  AND  operations;  if  there  are  several  operations  of  the  one 
level,  they  are  examined  from  the  left  to  the  right. 

(6)  carrying  out  of  logical  sum  OR  operations;  if  there  are  several  operations  of  the  one  level,  they  are 
examined  from  the  left  to  the  right. 

There  are  examples  of  logical  expressions: 

ABS(MOD-X1+Y1)<=1.0D-20 

NOT  (II=(2*N+1)*(2*NV+1)  AND  NU=0) 

W+1 


4.  Program  structure. 


program : 

{part}'^ 

pan : 

main-pan 
simple-part 
part -function 
maip-pan : 

MAIN  PART  name-part .  declaration-of-pan 


simple-pan : 


PART  name -simple-part .  declaration-of-part 


part-function : 

FUNCTION  name-function  [type-function] .  declaration-of-fiinction 
declaration-of-part : 

formal-parameters-of-part  BEGIN  body-of-part  END  PART 
formal-parameters-of-part : 

list-name  [RESULT  list-name] 
declaration-of-function ; 

formal-parameters-of-function  BEGIN  body-of-part  END  PART 
formal-parameters-of-function : 

list-name 
body-of-part : 

{ element-of-part)  * 
element-of-part : 
declaration . 
operator . 
iteration  . 

Program  in  NORMA  consists  of  one  or  several  parts.  The  parts  may  be  of  three  types  -  main  part,  simple 
pan  and  pan-t'unciion  (corresponding  to  key-words  MAIN  PART.  PART,  FUNCTION).  Parts  can  call 
each  other  by  name  and  communicate  data  with  help  of  formal  and  actual  parameters  or  through  external 
files  with  help  of  INPUT  and  OUTPUT  declarations. 

Localization  rule  is  true  for  each  part:  the  names  declared  in  the  part  are  localized  in  it.  There  are  no 
global  variables  in  the  language. 

The  main  part  must  be  in  the  NORMA  program  and  be  the  only  one  (it  has  no  formal  parameters.  Main 
part  calls  and  recursive  calls  are  prohibited. 

There  is  name  of  the  pan.  text-commentary  (if  you  need)  and  the  list  of  formal  parameters  in  the  part 
header .  Formal  parameters  must  be  declared  in  the  body  of  part  with  help  of  declaration  on  domain,  scalar 
declaration,  domain's  parameters  declaration  and  declaration  of  external. 

Parameters-variables  declared  before  the  key-word  RESULT  in  the  list  of  formal  parameters  are  initial 
data  for  computations  specified  in  the  part,  the  parameters  declared  after  are  the  results  ot  computations.  One 
and  the  same  parameter  can't  be  initial  and  result  at  the  same  time  (it  causes  reas.signment  which  is 
pr(’>hibited  in  NORMA).  The  key-word  RESULT  isn't  used  in  part-function  (the  result  of  computations  is 
connected  with  the  name  and  the  type  of  function). 

There  are  definitions,  operators  and  iterations  in  the  body  of  part.  In  general  their  order  is  arbitrary, 
possible  constraints  are  defined  in  translator’s  source  language  description). 


It  is  an  example  of  main  part  header : 

MAIN  PART  Linear .  !  Variant  of  May  25  1992 : 

!  ===:  linear  approximation  ======= 

!  the  case  of  axial  symmetry  for  M=1 ; 

!  computation  is  being  done  on  Beta  angle  for  the  point  1=0  ; 

!  further  for  I=l,2i^N  values 
!  are  being  extended  on  symmetry  for  V,FI,DAyL  . 

BEGIN 

body  of  part 

END  PART 

It  is  an  example  of  simple  part  header  ; 

PART  IntFKP . 

!  Computation  of  integral  from  FKP  function 
!  with  interpolation  of  integral  function 
BETA»ALPHA,  !  -  node  points 
ALL,ALC^LR,  !  -  points  for  interpolation 
STEP,  !  -  integration  step 
FKP  !  -  name  of  integral  function 
1  -  result  IntResult 

RESULT  IntResult 

BEGIN 

EXTERNAL  FUNCTION  FKP  DOUBLE. 

VARIABLE  ALPHA,BETA.ALL,ALK.ALR.STEP, IntResult  DOUBLE  . 

body  of  pan 

END  PART 

5.  NORMA  constructions. 

We  can  divide  Norma  constructions  into  declarations,  defining  program  objects  (e.g.  domains,  indexes, 
variables  )  and  constructions  for  computation  rules  specification. 

5.1  Declarations 

declaration : 

dcclaration~of  domain 

dcclaration-of-domaindnde.xes 

deciaration-of-scalar-variables 

declaratiothof-vanables-on-donmns 

declaration-of-ifidex-constructions 

declaration-of-distribution-indexes 


declaration-of-donmin-parameters 
declaration-ofinput 
declaration-of‘Output 
declaration-of  external 

Objects  which  can  be  declared  in  NORMA  are:  domains,  scalar  variables  (scalars),  variables  specified  on 
grid,  index  constructions,  distribution  indexes,  domain  parameters,  input  and  output  variables,  names  of 
external  functions  and  parts. 

5.1.1  Declarations  of  domains, 
declaration-of-domain : 

declaration-of  unconditional-domain 
decloration-of  conditional-domain 
declaration-of-unconditional-domain 
declaration-of-rectangular-domain 
declaration-of-diagonal-domain 
domain : 

new-domain-without-name 
name-domain 
unconditional-domain : 

new'-domain-wafioiit-name 
name- unconditional-domain 
name-domain  : 

name-unconditional-domain 
name-conditional-domain 
name -unconditional-domain  : 
name-  rectan  u  la  f -domain 
nanie-diaitonal-doniain 

The  notion  domain  was  introduced  in  NORMA  for  index  space  representation.  Domain  is  a  complex  of 

integers  numbers  sets  (ij, . i„  },n>0,  ij>0,  j-\ . n,  each  set  gives  coordinates  of  the  point  from  n- 

dimcnsional  space.  Unique  name  -  index  name  is  connected  with  one  of  the  directions  (coordinate  axes  )  of 
the  /7-diiTiensional  space. 

The  domain  defines  coordinates  values  of  index  space  points,  but  not  the  values  of  the  calculated 
variables  in  these  points.  For  example  if  you  need  to  calculate  the  value  of  the  variable  F//,  if  =  1 . n  on 

some  grid  if  =1,...W,  which  was  introduced  during  task  solution  (e.g.  by  formula  X^j  ^F(hff),  F- 
given  function,  h  -  given  parameter)  you  should: 


(1)  define  the  domain  consists  of  points  ij  = 

(2)  define  variables  X and  Y\ 

(3)  set  the  rule  of  grid  Xfj  computation  :  Xij  =  F(hJJ)  and  the  rule  of  Yjj  values  computation  Ytj  = 
G(Xij )  (jF,  G,  h  are  considered  to  be  given  somehow). 

Domain  in  NORMA  can  have  a  name.  The  operations  of  modification  and  multiplication  are  defined  on 
the  domains.  Domain  indexes  aren’t  specially  declared,  they  are  introduced  in  domains’  declaration.  Domain 
can  be  conditional  and  unconditional  Conditional  domain  consists  of  the  index  space  points.  The  number 
and  the  coordinates  of  these  points  can  change  depending  on  fulfilment  or  non  fulfilment  of  conditions  on 
domain.  Unconditional  domain  consists  of  the  points  from  index  space,  which  number  and  coordinates  may 
be  defined  during  the  translation. 

There  are  two  different  terms  in  NORMA  declaration  of  domain  (named  conditional  or  unconditional 
domain)  and  use  of  domain  (syntactically  it  is  the  name  of  domain  or  new  domain  without  name). 
Domains  are  used  in  declarations  of  the  variables  declared  on  domain,  setting  computation  domain  in 
ASSUME  operators,  input  and  output  variables  declarations,  setting  domains  of  actual  parameters  in  part 
or  functions  calls,  in  reduction  functions. 

5. 1 . 1 . 1  Declaration  of  unconditional  domain. 

declcinition-of- rectangiiiar-domain : 
multidimensional-domain 
new-domain 

multidimensional-domain : 
one-dimensional-domain 

I  name-multidimensional-domain  :  ]  ( domain-product ) 

domain-product : 

component-domain  {  ; component-domain  }* 
component-domain  : 
multidimensional-domain 
name-unconditional-domain 

oncdimensional-domain  : 

I  name-onedimensional-domain  :  j  f  iiame-index^valiu'  ) 

value  : 
ratige 

const-e.xpression 


range : 


const-expression ..  const-expression 
new-domain : 

[  name-new-domain:  ]  new-domain-without-name 
new-domain-without-name : 

name-unconditional-domain  /list-modification 
modification : 

name-index=value 

name-onedimensional-domain  {  {  }  boundary-function}^ 

boundary-function : 

LEFT (const-expression ) 

RlGWTiconst-expression) 
name-recTangular-domain  : 

name-onedimensional-domain 
name-multidimensional-domain 
name-new-domain 
declaration-of-diagonal-domain : 

name -dia gonal-domain  :  name-unconditional-domain  /  Wsi-condition-on-index 
The  notion  of  one-dimensional  domain  is  a  key  notion  in  rectangular  dimensions  declaration.  One¬ 
dimensional  domain  is  used  for  setting  the  range  of  points  on  some  coordinate  axe  in  the  index  space.  In  the 
simplest  case  the  name  of  one-dimensional  domain,  index  name  and  the  boundaries  of  index  values  changing 

are  declared  in  one-dimensional  domain  declaration. 

RegionK:  (k  =1..1S) . 

Name-onedimensional-domain  can  be  used  for  reference  to  this  domain. 

The  name  of  index  is  an  index  variable.  The  set  of  its  values  is  defined  by  the  range 
const-expression  ..  const-expression.  The  boundaries  of  the  range  are  integer  positive  constant  expressions 
built  from  integer  constants,  domain's  parameters  and  arithmetical  operations.  Particular  values  must  be 
assigned  to  the  -domain’s  parameters  in  the  declaration  of  domain’s  parameters.  The  value  of  the  low 

boundary  mustn’t  be  more  than  the  value  of  upper  boundary.  For  example. 

RegionK:  (k  =  1..15) . 

Multidimensional  domain  is  built  by  operation  of  rectangular  domains  multiplication. 

It  is  an  example  of  two-dimensional  domain  declaration.  This  domain  is  built  by  the  multiplication  ol 

one-dimensional  domains  AxisK  and  AxisL: 

Square:  (AxisK:  (k=l,..lS)  ;AxlsL:  (l=l...5)  ) . 

Domain  Square  may  be  defined  in  different  ways  (  e.g.  taking  into  account  the  previous  definition  ): 
Square:  (RegionK:AxisL:  (I=1...5)  )  . 


The  operation  of  rectangular  domains*  multiplication  has  an  ability  : 

A;B=B;A  if  A  and  B  are  domains. 

It  means  that  the  order  of  the  index  space  directions  in  domain’s  declaration  isn’t  fixed  (or  fixed  arbitrarily, 
that  is  the  same  for  user ).  If  the  order  of  index  space  directions  is  important  (e.g.  consistency  of  directions 
is  needed  for  using  the  same  variables  on  the  same  domains  in  the  different  parts)  it  may  be  set  by  INDEX 
declaration  (5.1.2). 

If  the  domains  are  operands  of  domains’  multiplication  operation  they  must  have  different  index  names. 
The  issue  of  rectangular  domain  modification  may  be  in  addition  of  some  points,  deletion  of  the  points 
or  changing  the  range.  The  modification  of  the  first  two  types  is  defined  by  the  boundary-functions 
LEFT(n)  and  RIGHT(n).  Function  LEFT  is  applied  to  the  left  boundary  of  the  range  and  function 
RIGHT  -  to  the  right  one.  Sign  “+”  means  that  the  points  are  added  to  the  one-dimensional  domain,  sign 
means  that  they  are  deleted.  Both  functions  have  one  parameter  n  which  defines  the  number  of  the 

points.  Integer  positive  constant  may  be  an  actual  parameter  of  the  function  only.  Call  to  the  functions  is 
allowed  only  in  the  context  with  the  name  of  one-dimensional  domain  setting  modificated  range: 
flame -onedimensional  domain  +LEFT  (n) 

or 

name-onedimensional  <iomam+R I G H T ( n ) . 

E.g.  the  declaration 

FlushK:  Squarc/.\xisK-LEFT(2)+RIGHT(2) . 

defines  domain  FlushK  which  consists  of  points  fc=3...I7. 

The  boundaries  of  the  resulting  range  (after  using  LEFT  and  RIGHT  functions)  must  be  positive 
integer  and  the  left  boundary  mustn’t  exceed  the  right  one. 

Declaration 

Square!:  Square  /  Axi5K+Left(l)-Right(2) . 

isn’t  right  because  the  left  boundary  of  the  resulting  range  ^=0...13  isn’t  positive. 

You  can  change  component  one-dimensional  domain  by  the  explicit  redeclaration  of  the  ranges.  For  this 
purpose  index  name  of  direction  and  its  new  value  must  be  declared  in  the  modification. 

E.g. 

Ncwsquare:  Square  /  Axils K+RIGHT(3>,  L=50...80. 

If  the  list  of  modifications  has  more  than  one  clement  modifications  dccliuvd  in  tliat  list  act  on 
modificated  domain  in  writing  order  (from  the  left  to  the  right). 

Besides  rectangular  domain  declarations  in  NORMA  there  is  a  possibility  ol  clnn^oudl  domain  settings  by 
the  superposition  of  conditions  on  previously  declared  domain  (rectangular  or  diagonal). 

Diagonal  domain  is  defined  by  conditions  on  indexes  of  some  previously  declared  domain  D .  It  consists 

of  domain  D  points  in  which  the  conditions  are  true.  E.g. 

KL:  ( ;  (l=1..10) ) .  Diagonal:  KIV  k=l. 


set  domain  Diagonal  consists  of  the  points  (A:=l,  /=!),  (k-l,  /=2),  ...,(^=10,  /=10). 

Indexes  on  variables  used  in  the  notation  of  conditions  on  indexes  can  be  either  internal  (from  D 
domains  set  of  indexes)  or  external  (5.2,3.2), 

The  valid  form  of  conditions  on  indexes  defines  statically  the  points  of  diagonal  domain  (it  is  also 
possible  for  rectangular  domain).  That’s  rectangular  and  diagonal  domain  are  static  objects.  They  differ  from 
conditional  domain  described  below. 

5. 1 . 1 .2  Declaration  of  conditional  domain. 

declaration~of-conditional~domain : 

name-conditional  domain  ,  name-conditional-domain  : 
name-domain  /  condition-on-domain 

Besides  static  domains  in  NORMA  you  can  set  conditional  domain  consists  of  the  points  from  index 
space.  Their  number  and  coordinates  can  change  depending  on  fulfilment  or  nonfulfilment  of  conditions  on 
domain. 

Here  we  describe  how  you  can  set  conditional  domain.  Previously  declared  domain  D  is  divided  into  two 
disjoint  subdomains  D1  and  D2.  The  first  subdomain  consists  of  the  points  where  the  conditions  on 
domain  are  true,  the  second  one  -  where  they  are  false.  Besides  D1  U  D2  =  Dl,  D1  fl  D2  =  0  . 

E.g.  set  domains: 

Domain:  ( ( i=2..n) )  ;  ( j=l..m  )  ) . 

Domainl.Domainl:  Domain  /  x+y[  i-1,  j  ]  -  '4  j+1  ]  >  0  . 

This  declaration  defines  division  of  initial  domain  Domain  into  domains  Domainl  and  Domain!  and 

Domainl  U  Domain!  =  Domain,  Domainl  fl  Domain!  =  0  . 

Domainl  consists  of  the  points  of  Domain  in  which  the  condition  X-^y[i -\J  ]  -  Z[j  +1  ]  >  0  is  true. 
DomainZ  -  where  the  condition  is  false. 

Indexes  of  the  variables  used  in  the  conditions  notation  can  be  either  internal  (from  D  domains  set  of 
indexes)  or  extemal(5.2.3.2). 

Program  fragment  defining  conditional  domains  Bf2Pl  and  BfNot2Pl.  and  their  further  division  into 
smaller  domains  (first  domain  BfNot!Pl  is  divided  into  two  conditional  domains  Nodes  and  NotNodes,  and  then 
domain  NotNodes  is  divided  into  two  conditional  domains  DomainTnie,  DomainFalse)  is  given  below  : 

DOMAIN  PARAMETERS  N=3,  NV=3  . 

Bnu:  (Nu=()..!’"N).Bf:  (  II=0..(2’^N+1)  *  (!*NV+1  ) .  BfNii:  (Bf;Nu) , 

B!!PI.BfNot2PI:  BfNu  /1I=(2’"N+1)*(2*NV+!  )  AND  NU=<). 

Nodcs.NotNodcs:  BfNot!PI  /  ABS  (BTNodcs  -BT  )  <  0.0001. 

DomainTruc.DomainFaLsc:  Nodes  /  .ABS(BTNu  -  BTNodcs  )  <  0.0001. 

VARIABLE  BT,  BTNodes  DEFINED  ON  Bf  DOUBLE  . 

VARIABLE  BTNu  DEFINED  ON  Bnu  DOUBLE  . 


5.1.2  Declaration  of  domains*  indexes, 
declaration-qf-domaindndexes : 

INDEX  list- name ’index 

The  order  of  the  index  space  directions  in  domain^s  declaration  isn't  fixed  (or  fixed  arbitrarily,  that  is  the 
same  for  user ).  If  the  order  of  index  space  directions  is  important  (e.g.  consistency  of  directions  is  needed 
for  using  the  same  variables  on  the  same  domains  in  the  different  parts)  it  may  be  set  by  declaration  of 
domains*  indexes.  The  order  of  index  space  directions  is  the  same  as  the  order  of  index's  names  enumeration 
in  declaration  INDEX  (  from  the  left  to  the  right ). 

We  consider  call  of  part  B  from  part  A  with  variable  X  values  communication  from  A  to  B,  Variable 
X  is  defined  on  SquarelnA  domain: 

PART  A, 

BEGIN 
INDEX  M. 

SquarelnA.:  (AxisK:  (k=il...5)  ;AxisL:  (I=1..10)  )• 

VARIABLE  X  DEFINED  ON  SquarelnA. 

COMPUTE  B(  X  ON  SquarelnA). 

END  PART 
PART  B.  X 
BEGIN 
INDEX  ij. 

SquarelnB:  (AxisK:  (i=1..5)  ;  AxisL:  (j=1..10)  )  . 

VARIABLE  X  DEFINED  ON  SquarelnB. 

END  PART 

Declarations  INDEX  k,i  and  INDEX  i,j  make  correspondence  between  indexes:  i-kj'-l  .  In  other 
words  the  values  of  X  variable  are  considered  in  the  one  and  the  same  way  in  both  parts.  If  INDEX  i,j 
declaration  from  part  B  is  substituted  to  INDEX  i,j  the  values  on  corresponding  directions  will  be 
inconsistent:  5  f  10,  10  f5. 


5.1.3  Declaration  of  variables, 
dec  laratiou’of  scalar-variables : 


VARIABLE  list-name-scalar  f  type  ] 


declaration-of-variables-on-domains : 

VARIABLE  Wst-definition-variables-on-domain  [  type  ] 
definition-variables-on-domain : 
list’ name -variable -on-domain 
DEFINED  ON  unconditional-domain 
type  : 

/REAL,  INTEGER,  DOUBLE/ 

Scalar  variables  and  variables  on  domain  are  arithmetical  variables.  There  is  a  unique  name  and  the  type 
of  variable  in  the  declaration  corresponding  to  every  arithmetical  variable.  The  types  of  variables  are 
REAL,  INTEGER  or  DOUBLE  (default  REAL). 

It  is  an  example  of  scalar  ‘s  declaration  : 

VARIABLE  Alpha,  X,  H.  VARIABLE  UK,  Z  INTEGER, 

Every  variable  on  domain  is  connected  with  the  domain  in  the  declaration.  This  domain  defines  index's 
names,  which  can  be  used  in  index  expressions  in  calls  to  this  variable  (  the  order  of  index  expressions 
isn’t  important).There  is  no  special  declaration  for  indexes,  they  are  introduced  in  domain’s  declarations. 
Square:  (AxisK:  (k=:1..5)  ;  AxisL:  (1=:1..5)  )• 

VARIABLE  First, Last  DEFINED  ON  Square, 

SingleK  DEFINED  ON  AxisK,  SingleL  DEFINED  ON  AxisL. 

Given  declarations  define  variables  First,  Last  on  domain  Square.  It  means  that  these  variables  can  have 
corresponding  indexes  k  and  1  in  the  index  expressions.  Variables  SingleK  and  SingleL  are  defined  on  domains 
AxisK  and  AxisL.  It  means  that  they  can  have  corresponding  indexes  k  and  1  in  the  index  expressions.  In  this 
call  First[k-l,l+l]  and  First[l+l,k-l]  are  equivalent  (5.1.2). 

In  call  to  variable  on  domain  the  rule  of  default  indexes  setting  is  performed.*  index  expressions  which 
coincide  with  index  name  may  be  omitted.  E.g.  notations  First[kJ],  Firstfk],  First[l],  First  for  variable  First  are 
equivalent. 

If  some  index  is  set  by  constant  (constant  expression)  you  must  declare  which  index  direction  this 
constant  concerns  to  ,  e.g.  First[k=5.1-l].  If  you  need  to  link  two  directions  by  some  formula  you  must  do  it 
in  explicit  way,  e.g.  diagonal  elements  of  the  matrix  First  may  be  defined  as  Firsifk.i=kl  or  Firstfl=k]. 

5.1.4.  Declaration  of  index  construction, 
declaration-ofinde.x-constriiction : 

MACRO  INDEX  name-inde.x-constrnction  \  Wsi-explicit-index-expression  ] 
explicit- index- expression : 

name -index  [{+.-)  const-expression  ] 


name-index  -  const-expression 

name-index  -  name-index  [{+,“}  const-expression  ] 


Index  construction  is  used  for  shortening  notation  of  complicated  index  expressions.  It  is  the  simplest 
case  of  macrodeclaration,  e.g.  here  are  the  declarations  of  index  constructions  Short  and  SecondShort  and  the 

declaration  of  domain  UKL  and  variable  V. 

UKL  :(  ( ;  (  k=3...17  )  ;  ( l=:3...18  ) )  . 

VARIABLE  V  DEFINED  ON  UKL  DOUBLE. 

MACRO  INDEX  Short  [  i-1,  j-1,  k+1  ]  • 

MACRO  INDEX  SecondShort  [  i-1,  j-1,  k+1,  1+2  ]. 

This  declarations  allow  notations  of  v[Short,  1-2  ]  or  v[  l-2,Short  ]  type  for  setting  variable  v[  i-1,1-2, 
j-1,  k+1  ]  and  V[  SecondShort  ]  for  setting  variable  V[  i-1,  1+2,  j-1,  k+1  ]. 

5.1.6.  Declaration  of  domain  parameters, 
declaration-of-domain-parameters : 

DOMAIN  PARAMETERS  Wst-prescription 
prescription : 

name-domain-parameter^int-const 

Range  boundaries  in  domain’s  declaration  may  be  set  by  parameters  of  domain  (  in  implicit  way  ).  The 
values  of  tiicse  parameiers  must  be  defined  in  the  part,  in  the  domain  parameters  declaration.  E.g. 

DOMAIN  PARAMETERS  M=3,  J=90,  DomainLeftBoundParameter=15. 

Parameters  of  domain  may  be  included  into  arithmetical  expressions  and  constant  expressions. 

5.1.6  Declaration  of  input  and  output  parameters, 
dec  laration-of -input : 

INPUT  list-input-scalar 
INPUT  list-input-on-domain 
input-scalar : 

name-scalar  f  attributes  J 
input-on-domain  : 

name-vanahlc-on-domain 
deckiration-of-output : 

OUTPUT  list-ouTput-scaiar 
OUTPUT  list-outpnts-on-domain 
output-scalar : 

name-scalar  f  attributes  ] 


outputs-on-doniain : 

\ist^outpiit-on-domain  [  attributes  ]  ON  domain 
output-on-domain : 

n/ime-variahle-on-domain 

attributes  : 

( list-attribute  ) 
attribute : 

STRf  int-constant ) 

TAB(  int-constant ) 

SPACEf  int-constant ) 
string 

ORDERdist-name-index) 

LENGTH- int-constant 
FILE=  'file-name  ’ 

ALL 
format 
format : 

I  int-constant 

{  F,E.D  }  int-constant .  int-constant 

Declaration  of  input  (output)  variables  means  that  all  the  variables  mentioned  in  the  list  of  input 
(output)  are  liable  to  input  (output).  E.g.  declarations 
B/Z<Eps. 

INPUT  Velocity  ON  A.  OUTPUT  Tau  ON  Bl. 

INPUT  X,  ALPHA. 

are  requests  for  input  of  scalar  X,  ALPHA  values  ,  value  of  Velocity  in  all  the  points  of  Bl  domain  where 
condition  Z<Eps  is  fulfilled. 

The  order  of  variables  input  (output)  isn't  set,  it  is  defined  during  the  translation  .Minimal  piece  of 
information  which  is  input  (output)  as  a  whole  is  scalar  or  variable  on  domain.  Attributes  may  control 
input  (output)  of  these  elements. 

.Attributes  act  on  the  variable  (list  of  variables)  declared  before.  E.g.  declaration 
INPUT  RLR2  (FILE  =*myrile’)  ON  Grid. 

is  a  request  for  input  of  variables  R1.R2  values  on  Grid  domain  (  in  other  words  for  all  the  index  values  ot  ^ 
this  domain)  from  file  myfile.dai. 

Attributes  help  to  control  the  form  of  input  and  output  data  and  bind  data  with  input  and  output  files: 
i‘  STR(n)  sets  the  skip  of  n-l  lines; 


V  TAB(n)  sets  n  spaces  before  the  beginning  of  the  line; 

i  SPACE(ii)  sets  n  spaces  in  the  line  begging  of  the  current  position; 

1  'string'  set  output  of  the  textual  constant  strings 

i  ORDER  defines  the  order  of  index  changing  for  input;  the  highest  index  is  the  first  from  the  right, 
the  lowest  is  the  first  from  the  left;  for  one  value  of  every  high  index  the  low  index  has  run  over  all  their 
values  ; 

I  LENGTH  sets  the  length  of  record  in  the  output  file; 

1  FILE  sets  the  name  of  input  or  output  files;  the  method  of  setting  file  name  is  defined  during 
realization  (  if  you  use  translator  version  for  personal  computers  input  file  has  a  default  name  with  .dat 
extension;  when  you  set  file  name  this  extension  may  be  omitted;  if  attribute  FILE  isn’t  given  input  is 
carried  out  from  file  nonna.dat  and  output  to  display  screen  ); 

1  ALL  sets  output  variable’s  name  and  its  coordinates  (index  values)  for  every  value  of  variable  on 
domain. 

1  fonimt  sets  format  for  numerical  values  for  input  and  output  and  corresponds  to  format  specificators  I , 
F,  E,  D  in  the  FORTRAN  language.  There  are  default  formats  :  E>15.8  for  variables  of  REAL  type, 
DI5.8  for  variables  of  DOUBLE  type  15  for  variables  of  INTEGER  type. 

For  input  variables  only  attributes  of  format  and  FILE  type  are  allowed. 

Here  is  an  example  of  input  variables  declaration  : 

INPUT  One  {FILE=‘fiIer,F10.2)  ON  Gridl, 

Two(FILE=:‘fiIe2’  )  ON  Grid2. 

Here  are  examples  of  output  variables  declarations: 

OUTPUT  VeIocity(STR(5)  ,TAB(7),  "Velocity  =  "  ,F9.l)  . 

OUTPUT  X(FILE=‘FILE17’  ,  ORDER(J,KJ),  ALL,  fS.l)  ON  Grid. 

OUTPUT  Y(‘Matrix*,SPACE(10),’of  values’^PACE(10),"Y  parameter’, 

FILE=t‘OT5’  ,ALL,F15.2)  ON  ABC. 

For  output  in  more  complicated  form  (  diagrams,  tables,  etc.)  you  should  use  standard  libraries  and 
packets  tools  or  your  own  programs  written  in  other  languages.(5.2.5). 

Syntax  of  input  data  files: 

flic  : 

inpitr-clcnicnr  (inpuhclcmcnr 

inpiir-clcinent  : 

mnne -scalar  -arithm-i 'onstant: 

/  mwe-yariahic-on-domwU  list-//  uicx-  range)-data: 

mdex-range : 

name-index  =  int-constant ..  int-constant 


data: 


list-data-element 
data-element : 
arithm-constant 
int-constant( data) 

Prograinmer  can: 

V  place  input  elements  in  any  order, 

V  control  the  order  of  numerical  values  in  file  by  changing  index  order, 

V  not  keep  to  one  format  in  numerical  values  notation, 

V  write  repeated  data  in  a  short  form. 

Here  is  an  example  of  input  data  file’s  contents: 

C(K=1..10)=5(-10.1)  ,5(1.01); 

ALPHA=3.17;  BETA=-0.12;  GAMMA=1.0E-10 
C(K=11..20)=5(10.1,5(-1.01); 

Here  is  an  example  of  input  data  file  data.dat  contents  for  Gauss  program: 
MAIN  PART  Gauss. 

Solution  of  linear  equations  by  Gauss-Jourdan  method. 

BEGIN 

Ot;(t=0..n).  Oi:(i=l.:n).  Oj:(j=l..n). 

Oij:(Oi:Oj).  Otij:(Ot;Oij). 

Oti:(Ot:Oi).  Otiil:Otij  /  t=l..n.  OtilrOti  /  t=l..n. 

DOMAIN  PARAMETERS  n=5. 

VARIABLE  a  ON  Oij.  VARIABLE  m  ON  Otij. 

VARIABLE  b,  ON  Oi.  VARIABLE  r  ON  Oti. 

INPUT  a(FILE=‘data’)  ON  Oij,  bfFILE=‘data’)  ON  Oi. 

OUTPUT  x(FILE=‘rcsults’,ALL)  ON  Oi. 

FOR  Otij/t=0  ASSUME  m=a. 

FOR  oti/t=o  ASSUME  r=b. 

OtiEQtijl.OtiNEtjliOtijl  /  i=t.  OiEQtil,OiNEtil:Otil  /  i=t. 

FOR  OtiEQtjl  ASSUME  m  =  m[t-l.i=t]/m[t-l,i=td=t}. 

FOR  OiEQjl  ASSUME  r  =  r[M,i=t]/m[M.i=t,j=t]. 

FOR  OiNEtjl  ASSUME  m  =  m[t-l]-m[t-l.j=t],m[i=t]. 

FOR  OiNEtil  ASSUME  r  =  r[t-l]-m(t-l.j=t].m[i=t]. 

FOR  Oi  ASSUME  x  =  r  lt=n]. 

END  PART. 

Input  file  data.dat: 

B(I=1.J)=5.0,  13.0,  3.0; 

Aa=1..5J=1..5)=2.0.  3.0,  -4.0,  5.0,  -1.0, 

3.0,  4.0,  -1.0,  6.0,  1.0, 


2.0,  0.0,  -3.0,  0.0,  4.0, 

0.0,  2.0,  0.0,  0.3,  0.0, 

3.0,  -1.0,  2(0.0),  1.0; 

B(I=4..5)=  5.0,  3.0; 

The  values  of  data  multidimensional  arrays  are  placed  in  the  file  corresponding  to  the  given  indexes. 
The  values  of  the  first  index  from  the  right  are  changed  the  first :  in  the  given  example  matrix  A(I,J)  is  set 
on  rows,  in  other  words  7=1,  J  =  1.2,3.4,5,  then  7=  2,  J  =  1,2,3,4,5  etc.  till  7=5, 7=  1,2.3, 4,5. 

The  way  of  input  file  data.dat  setting  isn’t  the  only  one. 

5.1.7  Declaration  of  external  names, 
declaration’Ofextemal’names  : 
declaration-of-extemal‘junctions 
declaration-of  external-parts 
declaration-of-extemal-functions : 

EXTERNAL  FUNCTION  lisUname- exte mal-fun ction  [  type  ] 
declaration-of-extemal-parts : 

EXTERNAL  PART  list-nam  e -extemal-simple-part 

If  the  names  of  the  functions  or  pans  are  their  actual  or  formal  parameters  they  are  indicated  In  the 
declaration  of  external  names  by  all  means  .  Default  type  of  external  function  is  REAL. 

EXTERNAL  FUNCTION  Last, First  DOUBLE. 

EXTERNAL  PART  Middle. 


5.1.8.  Declaration  of  distribution  inde.xes, 
declaration-of-distribiition-indexes : 

DISTRIBUTION  INDEX  name -index  -  simple-ran^e  [,  name-index  =  simple-range  ] 
simple-range : 

int-constant[ ..  int-constant] 

Declaration  of  distribution  indexes  is  used  for  representation  of  two  index  directions  from  task  domain 
index  space  on  processor  element  (PE'  matrix  of  distributed  system.  If  functions  and  parts  liave  this 
declaration  they  are  called  distributed,  if  there  is  no  such  declaration  in  their  content  they  arc  called 
nondistrihuted  .systems.  You  can  meet  litis  declaration  in  the  distributed  pan  or  function  only  once;  it  is 
prohibited  in  the  main  part  in  other  words  the  main  part  is  always  nondistrihuted. 

Given  declaration  results  in  data  and  control  distribution  between  PE  elements  of  the  system  or 
automatic  generation  of  data  exchange  between  PE  if  you  need  it.  The  subject  of  distribution  is  variables 
which  indexes  coincided  with  ones  in  declaration  DISTRIBUTION  INDEX.  E.g.  if  there  is  a  declaration 


DISTRIBUTION  INDEX  i=:2..8, 


all  the  variables  defined  on  the  domains  with  /  and  j  indexes  will  be  distributed  on  PE/j  with  virtual  row 

numbers  /=2..8  in  the  column  number y=l  1  of  PE  matrix. 

Computations  defined  in  nondistributed  part  or  function  are  carried  out  as  a  whole  in  one  PE  (  though 
there  may  be  several  such  PEs). 

Declaration 

DISTRIBUTION  INDEX  i=:2...8,j=0. 

is  false:  the  elements  of  PE  matrix  are  considered  to  be  numbered  from  1 . 

5,2  Operators  in  NORMA. 

operator : 

scalar-operator 
operator- ASSUME 
call-part 

There  are  three  defined  types  of  operators  in  NORMA.  They  are  scalar  operator,  operator  ASSUME  and 
call  part.  The  operators  are  used  for  description  of  computations  for  task  solution. 


5.2.1  Scalar  operator, 
scalar-operator: 

name-scalar  =  scalar-arithm-expression 
scalar-arithm-expression : 
arirhm-expression 

Scalar  operator  is  used  for  computation  of  scalar  arithmetical  values  .  In  fact  it  is  an  analogue  of 
assignment  statement  in  traditional  programming  languages.  Name  of  scalar  is  indicated  in  the  left  part  of 
the  operator  and  scalar  arithmetical  expression  built  in  usual  way  from  scalars,  arithmetical  constants, 
domain’s  parameters,  function  calls,  variables  on  domain  with  index-constants. 

Variables  defined  on  domain  which  index  expressions  are  not  constants  can’t  be  included  into  scalar 

arithmetical  expression  (  exception  from  this  rule  is  arguments  of  reduction  functions  (5.2.3) ). 

U:  I  (  i=l..MaxI)  ;  (j=l..Max.I  )  >. 

DOMAIN  PARAMETERS  Maxi  =  3,  Max.I  =  90. 

VARIABLE  ScaiarV  INTEGER.  VARIABLE  Pi  DOUBLE. 

VARIABLE  ArrayV  DEFINED  ON  I.T  DOUBLE. 

ScalarV  =  MaxI.(Max.T.l)/2+SQRT(PI)/SIN(  Array  V[i=l.j=551). 


Reassignment  in  scalar  operator  is  prohibited,  that’s  the  next  operator  is  false: 

ALPHA  =  ALPHA-1. 

5.2.2  ASSUME  operator, 
operator- ASSUME 

FOR  domain  ASSUME  relation! ; relation)* 
relation : 

name-variahle-on-domain  =  arithm-expression 
call-part 

ASSUME  operator  is  used  for  computation  of  arithmetical  variables  defined  on  domains. 

The  case  of  part  call  in  the  body  of  ASSUME  operator  is  described  in  5.2.4. 

The  terms  of  arithmetical  expression  in  the  right  part  of  relation  may  be  variables  defined  on  domain, 
scalars,  arithmetical  constants,  domain’s  parameters,  function  calls,  indexes. 

Informally  ASSUME  operator’s  semantics  is  defined  in  the  way  described  below.  Let’s  consider  given 
relation 

FOR  ASSUME 

where 

“  D(/; . L)  -  .ASSUME  operator's  domain. 

-  -  D  domain  indexes, 

-  ,  \<q<k-  variables  names  defined  on  domain, 

-  p  <n  -index  expressions  of  the  left  part 

-  -  index  expressions  of  the  right  part, 

-F  -  function  computed  in  the  right  part, 

-  Other  -  other  terms  of  the  right  part. 

Every  relation  set  the  rule  F  oi  variable  from  the  left  part  values  computation  based  on  the  values 
of  X  ‘^ . X  ’'  variables  and  the  terms  Other  from  the  right  part; 

(1)  all  the  points  (a, . D(/, . /„)  anedellned: 

(2)  .value  of  .V’  from  the  left  part  is  computed  in  huiL{u. . c/,. )  point  for  cvcit  poini 


(3)  index  expressions  IndR^''  (a, , ...  ,a„ )  of  all  the  variables  X^'' ,  \<q<k  from  the  right  part 

of  relation  are  computed  for  every  point  (a, ,... ,  )  €  D  and  the  set  of  right  part  arguments  is  defined 

. '^9^* 

(4)  if  for  some  period  of  time  for  point  (ap...,a„)€  Dali  the  arguments  from  X  have  been 


computed  the  computation  of  value  from  the  left  part  is  possible,  but  if  all  the  arguments 

haven’t  been  defined  the  computation  in  that  point  in  that  period  of  time  is  impossible  (it  doesn’t  mean  that 
it  is  impossible  in  all  cases). 

The  name  of  ASSUME  operator  underlines  the  fact  that  it  defines  the  rule  of  value  computation  in  the 
only  way  but  doesn’t  require  immediate  computation  in  particular  place  of  program  and  doesn’t  arrange  the 
order  and  the  method  (parallel  or  sequential)  of  computation. 

E.g.  if  the  variable  is  defined  as 
Matrix:  (  (I=1..5)  ;  ). 

VARIABLE  DEFINED  ON  Matrix, 
then  operator 

FOR  Matrix  ASSUME  X  =0. 

is  a  request  for  25  elements  of  variable  X  zeroisement.  The  method  of  this  request  realization  isn’t  fixed  in 
this  language. 

NORMA  is  a  single-assignment  language,  reassignment  is  prohibited.  That’s  further:  operators  are 
incorrect: 

FOR  Matrix  ASSUME  X  =  Y;  X  =  Z. 

FOR  Matrix  ASSUME  X  =  X+1. 

Constraints  on  the  index  form  the  left  part  of  relation  (  these  are  indexes  without  displacement)  are  not 
essential  because  domain  D  from  the  operator’s  header  allows  to  define  rather  complicated  relations.  If  you 
need  to  define  the  computation  of 

Xfj  =  F(Y„Zj)  on  domain  Ox  :  ( ( I=1..N ) ;  ( J=1..M  ) ) . 

type  you  may  do  it  in  this  way  : 

OxJI:  Ox/.T=I. 

FOR  Ox.Ti  ASSUME  X  =  F<Y,Z) , 

As  we  mentioned  earlier  functions  may  be  included  into  arithmetical  expression  and  scalar 
arithmetical  expression. 


5.2.3  Function  calls  in  Norma, 
call-function : 

call- reduction-function 


calUstandard'function 
call-extemal-function 
call- reduction-function : 
name-reduction-fiinction 
(  (  name-domain  larithm-expression  ) 
call-standard-junction : 

name-standard-function  (  arithm-expression  ) 
name-  reduction-function  : 

{  SUM, MULT, MAX, MIN  } 

call-extemal-function : 

name -external- functionf  (  Wst-in-parameter )  ] 
in-parameter : 

arithm-expression 

name- variable-on-domain  ON  domain-of -parameter 
iterated-variahle -on-domain  ON  domain-of-parameter 
name-extemal-simple-part 
name-extemal-fiinction 
domain-of-paraniercr : 

name-unconditional-domain 
name- unconditional-domain  /  index-expression 
name-unconditional-domain  /  (Wstdndex-expression) 
iterated-vanahle-on-domain : 

name- variahle-on-domain  [  name-iteration-index  [-1]  ] 

There  are  defined  standard  arithmetical  functions,  reduction  functions  and  external  user's  functions  in  the 
language. 


5.2.3. 1  Standard  arithmetical  functions. 

Functions  :  sqrt{x).  ahs(xK  espix).  alo(^(x),  alo(tlOix).  sinix).  cosix}.  (isin(x).  armUx),  cntienx)  lu'o 
standard  functions. 

5. 2. 3. 2  Reduction  functions 

Functions  :  SUM  (sum),  MULT(muitiplication).  MAXimaxiinum ).  MIN(  mini  mum,)  arc  reduction 
functions.  Call  to  these  functions  : 


name-functionf  (  name-domain )  arithm-expression). 


Domain  defines  the  number  of  domain’s  point  where  the  function  will  be  computed,  arithmetical 
expression  defines  the  number  of  values  which  the  function  are  applied  to.  Here  is  an  example  : 

'■=' . "• 

y=i 

The  fragment  of  this  computation  in  NORMA  : 

Grid:  (Oi:  (I1..N)  ;  Oj:  (J=1..M) ). 

VARIABLE  A  DEFINED  ON  Grid. 

VARIABLE  v,w  DEFINED  ON  Oi.  VARIABLE  X  DEFINED  ON  Oj. 

FOR  Oi  ASSUME  V  =  w  +  SUM(  ( oj )  a*X)  . 

Here  is  an  example  of  embeded  reduction  functions: 

SurfaceMax: 

(Left:  (Three:(IV=1..3)  ;  A:(J=0»2*M);B:(I=0..2*N)  ); 

Right:  ( ( IK=1..9)  ;  (W=0..2*M)  ;  BNU:fNU=0..2*N) ) ). 

SUMPOL:(  (W=0..2*M)  ;BNU). 

ThreeAB:(Three;A;B).  OLINE:(LINE=l.J). 

VARIABLE  vv,v  DEFINED  ON  ThreeAB  DOUBLE. 

VARIABLE  JSEQ  DEFINED  ON  SurfaceMax  DOUBLE. 

FOR  ThreeAB  ASSUME 
VV=SUM(  (SUMPOL) 

SrMr(OLINE)  vriVrrLTNEl  ^ 

(V[IV=1]  -  JSEQ[IK=LINE]  +  V[IV=2]  JSEQ[IK=LINE+31  +  V[IV=3]  ^  JSEQriK=LINE+61) )  ) . 

The  rule  of  index  localization  is  true  for  reduction  functions  :  domain  declared  as  argument  of  reduction 
function  has  its  own  index  system.  The  subject  of  this  system  action  is  arithmetical  expression  declared  as 
the  second  argument  of  reduction  function.  It  means  that  indexes  used  in  the  arithmetical  expression  may 
be  divided  into  two  types  :  "intemal”  which  coincide  with  the  indexes  of  the  reduction  domain  and  defined 
by  them  and  “external”  which  don't  coincide  with  indexes  of  reduction  domain  and  defined  by  external 
domains  (  e.g.  domains  of  other  reduction  functions  or  ASSUME  operator  ).  The  set  of  “internal”  indexes 
values  is  completely  defined  by  the  domain  set  as  an  argument  of  reduction  function.  The  set  of  “external” 
indexes  values  is  defined  by  the  domains  which  are  external  for  reduction  function. 

E.g.  operator 

FOR  Oik  ASSUME  X={i+k)  /  SLM{  (OljS)  B,c. 
defines  request  for  computation 
i  +  k 

-  i.k  =  \ . 10 

I 

if  the  declarations  are  : 

Oij55:  ((i=1..5) :  0=1-5)).  Oik:  (i=1..10) ;  (k=1..10)). 

Oijk:  (OiJ:((i=:1..15) ;  0=1-15))  ;(k=1..15)). 


VARIABLE  B  DEFINED  ON  Oijk.  VARIABLE  C  DEFINED  ON  Oij. 

VARIABLE  X  DEFINED  ON  Oik. 

If  external  domain  is  conditional  the  condition  acts  on  all  the  external  indexes  of  arithmetical  expression 
which  are  under  the  sign  of  reduction  function.  E.g.  formula 

100 


/  =  U,10,  ,10,  i<j,  };<0, 


/  =  l 

sets  computation 


100 


100 


r,<0:  ...  };.<0:  = 

/=!  /=! 

100  100 

J^3<0:  ^2.3  =  2^/.3J^3  -  Y,0<0:  X2..0  = 


/=! 


100 

/=! 

may  be  written  as  : 

Oij:  (Oi:  (i=1..10);  Oj:  (1..10))  .  OiLTj:Oij  /  i<l. 

OiLTj  YLTO,  OiLTjYGEOiOiLTj  /  Y<0. 

OilOO:  (i=1..100).  OijlOO;  (OilOO;Oj). 

V.4RIABLE  B  DEFINED  ON  Oijioo.  VARIABLE  X  DEFINED  ON  Oij. 

VARIABLE  Y  DEFINED  ON  Oj. 

FOR  OiLTJYLTO  ASSUME  X=SUM(  (  OilOO)  B,Y). 

Recursive  computations  are  prohibited  for  the  values  used  in  both  the  left  part  of  ASSUME  operator 
and  arithmetical  expression  in  reduction  function.  E.g.  formula 
10 

=  7  =  1,...,5 

/  =  1 
></ 

5.2, 3  J  User's  external  functions. 

External  function  is  defined  by  user.  One  can  call  this  function  by  name,  the  list  of  actual  parameters 
divided  by  comma  is  put  into  brackets  and  placed  after  the  name.  The  result  of  funciiiuTs  computations  is 
represented  by  function's  name.  If  all  the  actual  parameters  have  their  values  compulation  is  available  . 
All  the  actual  parameters  are  considered  initial  data  for  function  compulation,  side  effect  is  prohibited. 

Arithmetical  expressions,  part  and  function  names  or  variables  on  domain  (arrays)  can  be  actual 
parameters  of  external  function. 

Here  is  an  example  of  external  function  with  different  ways  of  actual  parameters  setting  . 


Grid:  (Oi:  (I=L.N)  ;  Oj:  (J=1..M)). 

VARIABLE  A,  B  DEFINED  ON  Grid. 

VARIABLE  X,Y  DEFINED  ON  Oi.  VARIABLE  T  DEFINED  ON  Oj. 

VARIABLE  Gamma  DOUBLE¬ 
INPUT  A,B  ON  Grid,  Y  ON  Oi,  T  ON  Oj.  INPUT  Gamma. 

OUTPUT  X  ON  Oi. 

DOMAIN  PARAMETERS  N  =  10,  M  =  20.  EXTERNAL  FUNCTION  DxDy. 

FOR  Oi  ASSUME  X  =  F(DxDy,  Gamma+0.5.Y,  T  ON  Oj,  A  ON  Grid  /  (i=2..8J=:M.2),  B  ON  Grid 
/j=I+2). 

ASSUME  operator  set  the  rule  of  array  /  =/,70  computation.  User’s  function  F  with  parameters 
values  are  to  be  computed  for  every  /.  The  list  of  parameters  : 

1- st  parameter  -  name  DxDy  of  external  user’s  function; 

2- nd  parameter-  arithmetical  expression  Gamma+0.5*Yf; 

3- rd parameter- 7^ y=:7,..., 20  values  of  static  array; 

4- th  parameter  -  Aij  values  of  array’s  section  :  one-dimensional  array  consists  of  the  2-8 

rows  elements  from  the  1 8th  column  of  A  matrix; 

5- th  parameter  -  5,  /.  /=/ . IOj-i+2  of  dynamic  array’s  section  :  7+2  column  of  B  matrix. 

The  rules  of  formal  and  actual  parameters  correspondence  : 

(1)  Tlie  number  of  actual  and  formal  parameters  must  be  equal;  correspondence  is  set  from  the  left  to  the 
right  in  writing  order. 

(2) 


Formal  parameter 

Actual  parameter 

function  name 

function  name 

part  name 

part  name 

scalar 

arithmetical  expression 

variable  on  domain 

variable  on  domain 

(3)  If  both  fomial  and  actual  parameters  arc  variables  on  domain  then  the  number  of  domain  indexes  ;md 
the  number  of  points  in  the  ranges  on  corresponding  indexes  must  be  the  same. (5. 1 .2). 

5.2.4  Part  call, 
call-pan : 

COMPUTE  name-pan  [  (actual-parameters)] 


actual-parameters : 

[list-in-parameter] 

/RESULT  lisi-out-parameter] 

out-parameter : 
name-scalar 

name-variable-on-domain  ON  domain-of-parameter 
iterated-variable-on-domain  ON  domain-of-parameter 

Arithmetical  expressions,  part  and  function  names  or  variables  on  domains  (arrays)  can  be  actual 
parameters. 

Actual  parameters  may  be  declared  as  initial  or  results  by  key  word  RESULT. 

E.g.: 

COMPUTE  Velocity  (  Delta+0.5,  Fi  ON  Oijk  RESULT  V  ON  Oij). 

The  first  two  parameters  are  initial  ,  the  third  is  result.  Side  effect  is  impossible  :  if  sets  of  initial 
parameters  and  parameters-results  intersect  the  result  of  intersection  will  be  reassignment  which  is 
prohibited  in  NORMA. 

Part  call  is  a  development  of  notion  relation  in  ASSUME  operator  because  it  allows  to  obtain  several 
results  at  once.  Part  call  without  ASSUME  operator  is  a  development  of  scalar  operator  notion. 

The  ways  of  part  initial  actual  parameters  setting  and  the  ways  of  function  actual  parameters  settings 
(5. 2. 3. 3)  are  the  same. 

Scalars,  variables  with  indexes(  may  be  set  by  the  rule  of  default  indexes),  variables  on  domains  (arrays) 
may  be  actual  parameters-results. 

If  part  call  is  in  the  body  of  ASSUME  operator  then  scalars  and  variables  on  static  domain  (  which 
don’t  change  for  different  values  of  indexes  from  ASSUME  header)  can’t  be  parameters-values  -  it  causes 
reassignment. 

If  pan  call  isn’t  in  the  body  of  ASSUME  operator  then  scalar  names,  variables  with  index-constants 
and  variables  on  static  domains  can  be  parameters-results  only. 

Here  is  an  example  of  actual  parameters  settings  in  different  ways  . 

VARIABLE  A,B  DEFINED  ON  Grid:  (Oi:  (I=l..N);Oj:  (J=1..M)  ). 

VARIABLE  X,Y  DEFINED  ON  Oi.  VARIABLE  T  DEFINED  ON  Oj. 

VARIABLE  Gamma  DOUBLE. 

INPUT  A  ON  Grid  ,Y  ON  Oi.T  ON  O.T.  INPUT  Gamma. 

OUTPUT  X  ON  Oi.  B  ON  Grid  /.I=2..12. 

DOMAIN  PARAMETERS  N  =io.  =20. 

EXTERNAL  PART  DzDy. 

FOR  Oi  ASSUME 


COMPUTE  TEST  (  DzDy,  T  ON  Oj,  Y  RESULT  X,  B  ON  Grid  /j=l+2). 

COMPUTE  SCALAR(  A  ON  Grid  RESULT  Gamma,  B  ON  Grid  /j=2). 

The  rule  i=l,...,I0  and  part  of  matrix  B  :  Bij,  i=,...,10,  j=3 . 12  computation  is  defined  in 

ASSUME  operator. 

Scalar  call  of  part  SCALAR  defines  scalar  Gamma  and  the  second  column  of  B  matrix. 

The  rules  of  actual  and  formal  parameters-results  correspondence  are  the  same  as  for  correspondence 
rules  for  functions(5.2.3): 


Formal  parameter 

Actual  parameter 

scalar 

scalar 

scalar 

variable  with  index  constant 

variable  on  domain 

variable  on  domain 

5.2.5  Interface  with  FORTRAN  programs. 


Subroutines  and  functions  written  in  Fortran  may  be  called  from  the  program  written  in  NORMA  by  usual 
tools  of  part  and  function  call.  Reassignment  control  is  carried  out  in  actual  parameters  analysis. 

For  example,  subroutine 

SUBROUTINE  SNIPPY(Y,N,X) 

REAL  X(N),  Y(N) 

DO  1  I=1,N 
X(I)  =  SIN(Y(I)) 

1  CONTINUE 
RETURN 
END 

can  be  called  as  it  is  shown  here: 

Oi  :  (  I=1..N  ). 

VARIABLE  X.Y  DEFINED  ON  Oi. 

INPUT  Y  ON  Oi. 

DOMAIN  PARAMETERS  N  =  to. 

COMPUTE  siNXYtY  ON  Oi.  N  RESULT  X  ON  Oi). 


5.2.6  Setting  of  sequential  computing  mode. 


The  order  of  carrying  out  operations  (operators)  is  often  important  in  computational  algorithms 
realization.  The  order  may  have  an  influence  on  convergence,  stability  of  solution  method  etc.  There  is  a 
possibility  of  sequential  computations  mode  setting  in  Norma.  It  allows  user  to  fix  the  order  of 
computation.  On  this  purpose  delimiters  #  ...  #  are  used.  Operators  put  in  the  delimiters  #  ..#  are  performed 
in  the  order  written  in  program.  Correctness  of  operators’  sequence  in  Norma  is  under  control  : 
reassignment,  using  of  undeclared  values  ,  etc.  are  fixed.  The  notation 
# 

X=5.0  .  Z=SIN(X+0.5)  .  Y=:COS(X)-Z*^X, 

# 

is  correct,  but  notation 
# 

X=5.0.  Y=COS(X)-Z*^X  .  Z=SIN(X+0.5)  . 

# 

is  incorrect  because  undefined  value  of  Z  \'ariable  is  used  in  the  second  operator(  if  we  take  off  delimiters 
then  both  notation  are  correct). 

5.3  Iteration 

iteration  : 

head-of-iteration 
[ boundary-value  ] 
initial-value 
body-of-iteration 
exit-condition 
end-iteration 
head-of-iteration : 

ITERATION  Wsi-iterated-element 
ON  nan le-iteration-indcx  . 
iterated-element : 

naine-vaiiablef  (list-naine-result)  / 
bnundaiy-value : 

BOUNDARY  {  operator.r  END  BOUNDARY 


inirial-valiie : 


{ INITIAL  name-iteration-index  - 
int-constant:  ( element-of-initiai}* 

END  INITIAL/^ 
element-of-initial : 
operator 

declaration-of-input 
declaration-of-output 
body-of-iteration : 

{  element-of-iteration-body,r 

element-of-iteration-body : 
operator 
iteration 

declaration-of-output 
exit- condition : 

EXIT  WHEN  log-expression 
end-iteration : 

END  ITERATION  name-iteration-index 

Computation  process  often  is  iterative  in  mathematical  physics  problems  solutions.  Such  process  may 
be  set  by  previously  described  tools  of  the  NORMA  language.  You  should  extend  domains’  declarations 
adding  extra  direction  corresponding  to  iteration  index.  But  to  use  such  an  extension  isn’t  always  correct, 
because  the  direction  is  fictitious  and  it  represents  the  way  of  computation  but  not  space-time  grid.  Besides 
the  boundaries  of  such  fictitious  direction  are  often  unknown. 

Special  construction  ITERATION  allows  possibility  to  set  iterative  computational  process  avoiding 
all  the  difficulties  mentioned  above. 

Informally  iteration  set  iterative  computations  with  iteration  index  changing  from  0  to  some  integer 
positive  value  which  is  a  condition  of  iteration’s  end. 

Let’s  take  iterative  computational  process  dealt  with  the  system  of  equations  solution: 

m 

^A,jXj=F,,  i  = 

/  =  ! 

and  set  by  formulae 

X°=X0.  f  = 

exit  condition  jjA^"  —  j|<  £ 


This  process  written  in  NORMA: 


Array:  (Oi:  (I=1..M)  ;Oj:  (J=1.,M))  . 

VARIABLE  X0,X,F  DEFINED  ON  Oi.  VARIABLE  A  DEFINED  ON  Array. 

VARIABLE  Epsilon. 

DOMAIN  PARAMETERS  M  =  100. 

INPUT  XO  ON  Oi,  A  ON  Array.  INPUT  Epsilon. 

OUTPUT  X,  Xpred  ON  Oi. 

ITERATION  X  (Xpred)  ON  N. 

INITIAL  N=o  : 

FOR  Oi  ASSUME  X  =  XO. 

END  INITIAL 

FOR  Oi  ASSUME 
X  =  l/A[j=l],(F-SUM(  (Oj/ioj)A.X[I=J,N.l]). 

EXIT  WHEN  MAX(  (Oi)  ABS(X[N]  -  X[N-1) ) )  <  Epsilon 

END  ITERATION  N. 

Iteration  itself  is  the  last  8  lines  of  example.  There  is  iteration  index  (here  it  is  N)  and  variables  taking 
part  in  computations  and  presenting  the  result  of  these  computations  (  -  X  -is  the  value  from  the  last 
iteration  step,  -  and  Xpred  —  the  value  from  the  before  the  last  iteration  step)  in  the  iteration  header.  Value 
Xpred  doesn’t  require  supplementary  definition,  it  is  considered  to  be  declared  in  the  same  way  as  X.  Values 
X  and  Xpred  can  be  used  for  computations  out  of  iteration:  in  the  given  example  these  values  are  declared 
output. 

You  can  use  the  possibility  of  iterated  variables  boundary  values  of  setting  (in  general  it  is  not 
obliged).  Boundary  values  of  iterated  variable  are  set  by  common  Norma  operators  inside  the  block 
BOUNDARY  ...  END  BOUNDARY.  These  values  are  considered  unchangeable  on  iteration  and 
defined  on  ever)'  its  step. 

Initial  values  for  iterated  variables  are  set  by  blocks  INITIAL...END  INITIAL  (there  may  be  several 
ones  ).If  the  values  of  iterated  variables  depend  only  on  the  values  of  the  previous  level  (one-level  iteration) 
then  only  one  block  is  indicated 

INITIAL  N=0:...  END  INITIAL. 

If  the  values  of  iterated  variable  depend  on  the  values  of  two  previous  levels  (two-level  iteration)  then  two 
blocks  are  indicated 

INITIAL  N=0:  ...  END  INITIAL 

INITIAL  N=.i:  ...  END  INITIAL 

etc. " 

Variables  which  are  the  result  of  multilevel  iteration  and  used  outside  iieratu)n  are  indicated  in  the  header  oi 
iteration.  E.g.  the  header 

ITERATION  X(X1,X2,X3)  ON  N. 


defines  values  X,X1^2,X3  of  iterated  value  X  obtained  at  the  last  four  steps  of  four  level  iteration.  The 
number  of  results  can’t  be  more  than  n  for  /i-level  iteration. 

Body  of  iteration  is  part  of  NORMA  program.  In  particular  you  can  define  new  iteration  in  another 
direction  inside  the  iteration.  In  the  given  example  body  of  iteration  consists  of  the  only  ASSUME 
operator. 

Iteration  index  can  be  used  in  the  list  of  indexes  indicated  for  iterated  variable.  In  fact  we  can  consider 
that  iterated  variable  has  supplementary  index  in  boundaries  of  iteration  (  on  factitious  direction  ).  Iteration 
index  without  displacement  may  be  not  indicated. 

Only  iterated  variables  can  indicate  iteration  index. 

The  values  of  variables  unindicated  in  the  list  of  iterated  variables  may  be  computed  in  the  body  of 
ITERATION  construction.  Such  variables  may  be  used  for  interim  results  of  each  level  iteration 
representation.  The  usage  of  such  variables  doesn’t  break  reassignment  prohibition  :  we  consider  the  a  new 
copy  of  the  variable  is  used  at  each  step  of  iteration. 

Iteration  process  ends  if  logical  expression  set  in  exit  condition  becomes  true. 


Appendix  L  Representation  of  initial  program. 

Initial  program  is  represented  in  initial  file  according  to  the  rules  : 

1.  Text  of  every  part  of  NORMA  program  is  written  in  unformatted  representation.  If  you  need  to  carry 
operator  or  declaration  it  is  prohibited  to  break  key  words,  identificators,  constants.  In  other  cases  you  can 
carry  the  words  as  you  like  it. 

2.  Key  words,  identificators,  constants  are  separated  by  spaces,  special  symbols,  end-line  symbol. 

Spaces  are  not  meaningful  symbols:  the  group  of  spaces  are  considered  as  one  space. 

3.  The  line  or  part  of  line  beginning  from  symbol  “!”  is  a  commentary. 

4.  Information  placed  in  the  interval  from  the  symbol  standing  at  the  first  position  up  to  the  symbol  “?” 
also  standing  at  the  first  position  are  not  translated.  It  allows  to  choose  from  initial  file  those  parts  which 
you  want  to  translate. 
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The  problems  of  loop  process  organization  for  the  program  written  in  nonprocedural  language 
NORMA  are  considered  in  this  paper.  An  algorithm  of  designing  the  system  of  simple  loops  allowing 
parallel  processing  is  given.  The  algorithm  is  based  on  the  notion  of  computation’s  front  which  is  a 
hyperplane  where  variables’  values  may  be  computed  in  every  its  point.  The  task  of  Linear  Integer 
Programming  is  solved  for  determination  of  the  hyperplane’s  parameters. 

Key  words:  nonprocedural  language,  synthesis  of  the  program,  parallel  computations. 
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1.  Introduction. 

The  NORMA  language  is  used  for  writing  numerical  solutions  of  mathematical  physics 
problems.  Program  in  NORMA  is  a  strict  notation  of  formulae  and  some  supplementary  information 
for  a  translator  to  design  computer  program. 

Program  has  no  information  about  the  order  of  computation  and  the  ways  of  loop  process 
organization.  Determination  of  computation  order  is  one  of  the  main  translator’s  tasks.  Information 
connections  set  in  the  program  are  initial  data  for  computation  order’s  determination  and  organization  of 
loop  process. 

Nonprocedural  nature  of  NORMA  allows  to  overcome  some  difficulties  in  algorithm 
parallelization. 

An  algorithm  of  loop  process  organization  for  NORMA  program  is  given  in  this  paper.  Loop 
parallel  processing  can  be  organised  by  this  algorithm. 

Principal  NORMA  operator. 

There  are  declarative  and  executive  parts  in  NORMA  program.  Variables  ‘  names  types  which 
are  to  calculate  in  the  task  s  computation  are  indicated  in  declarations.  The  rules  of  carrying  out 
computations  according  to  the  language  semantics  are  placed  in  tJie  executive  part  [2]. 

Let’s  describe  briefly  principal  language  operator  ASSUME.  The  structure  of  the  operator  is 
given  below: 

FOR  <domain  >  ASSUME  <relation>. 

Let’s  determine  informally  the  terms  “domain”  and  “relation”  .  The  domain  is  a  set  of  points 
with  integer  non  negative  coordinates  in  n-dimensional  space. 

The  form  and  the  type  of  relation  is  given  below: 

I  1  m 

A  1  •  •  •i  ^  ~  1, /77 

/ 

Here  X  are  variables  names.  Set  is  a  set  of  indexes.  Index  construction 

Af^  /c  =  1.  /77  has  the  following  structure: 

^</I<  n 

A/.  -  integer  non-nesative  constants. 

/ 

Relation  sets  the  rule  of  deriving  X  value  from  the  left  part  of  the  relation  by  the  values  of 
variables  from  the  right  part.  Any  variable  is  calculated  only  once  in  every  point  of  domain  i.e. 
reassignement  is  prohibited  in  NORMA.  Operator  ASSUME  points  out  the  necessity  of  computing  the 
variable’s  values  from  the  left  part  of  the  relation  for  all  the  points  of  the  domain  set  in  the  operator’s 
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header.  In  general  it  doesn’t  mean  that  computing  of  X  will  be  carrying  out  simultaneously  for  all 
the  points  of  the  domain  in  the  particular  place  of  the  program. 

Example.  Let  two  operators  be  set: 

1. FOR  A:  (k=L.m)  ASSUME 

X=fi(Y,J. 

2.  FOR  A  ASSUME 

Y=f2(X0 

We  consider  value  of  variable  Y  for  has  been  calculated  before  these  operators.  It  is 
evident  that  operators  order  where  all  the  values  ofX  variable  are  calculated  first  and  Y  values  are  done 
after  them  is  impossible.  Argument  is  to  be  computed  for  calculating  X  even  when  K-2  (  here 
argument  is  Y  value  for  K-\),  Y is  calculated  in  the  second  operator  i.e.  X  and  X  components  are 
computed  by  turn  in  the  given  case. 

Determination  of  the  operators’  order. 

One  of  the  problems  to  be  solved  on  the  translation  stage  is  the  problem  of  operators  (or  the 
groups  of  the  operators )  order  determination. 

To  specify  a  program  as  a  dependence  graph  is  a  traditional  way  of  program  representation  and 
we  use  well-known  approaches  and  methods  in  our  work. 

Let’s  determine  dependence  graph  G  {V,  E,X)  which  is  a  directed  graph.  V  denotes  a  set  of 
nodes  corresponding  to  the  names  taking  part  in  the  computations.  If  variable  X is  calculated  in  several 
operators  it  corresponds  to  several  nodes.  E  denotes  a  set  of  arcs  corresponding  to  relations  between 

A 

variables.  Arc  from  the  node  corresponding  to  variable  X  to  the  node  corresponding 

'  /  /  1^ 
to  variable  X  conforms  to  the  dep)endence  X  on  the  values  of  X  with  index  structure 

±  A  ^  ,  -  -  •  • ,  X  A  .X  denotes  a  set  of  arcs*  marks. 

Strongly  connected  subgraph  of  graph  G  not  being  included  into  any  other  strongly  connected 
subgraph  of  graph  G  is  called  the  Most  Strongly  Connected  Subgraph  (MSCG)  of  graph  G.  Having 
chosen  all  the  MSCG  G G„  (  e.g.  by  the  method  suggested  in  [3])  partial  arrangement  of 
G/,...,  G„  could  be  done:  if  there  is  an  arc  from  G^  to  Gj ,  then  /  >7  . 

Consider  loop  program  realization  R(Gi)  built  for  every  G^.  Then  the  program  on  the  set 
graph  G  can  be  represented  in  the  form  of  realizations’  R(G^  /=1,  H  sequence  taking  into 
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consideration  partial  arrangement  set  earlier.  Thus  let’s  consider  an  algorithm  of  designing  loop 
realization  for  the  operators  corresponding  to  the  nodes  of  MSCG. 

Loop  program-  realization  is  designed  for  particular  operators.  Further  we’ll  determine  the 
class  of  these  operators. 


1  m 

Consider  relation  system  for  computing  values  X 
11  m 

.  (1) 

m  1  m 

Xj^^  “  ^/7j(  •  •  • » 

Initial  domain  is  set  by  inequalities: 

/f=Vn  (2) 

System  (1)  is  said  to  correspond  to  some  MSCG. 

Task  may  be  formulated  in  the  following  way: 

1  m 

Design  loop  program-realization  which  provides  computation  of  variables  X,...,X  for  all  the  points 
oi'ihe  domain  (2)  if  such  computation  is  possible.  Program  realization  must  be  designed  according  to 
the  given  system  of  relations  (1). 


Front  of  computations. 

Further  we ‘11  consider  compulation  organization  only  in  the  formulae  corresponding  to 
MSCG.  Considered  model  of  computations  is  realized  in  several  stages.  The  first  stage  consists  of 
computing  the  variables  depended  on  external  data  (  i.e.  the  values  of  such  variables  is  computed 
outside  given  MSCG).  The  set  of  points  where  variable  L/is  calculated  at  the  first  stage  is  indicated  as 

Tlie  second  stage  consists  of  computing  the  variables  depended  on  both  external  data  and  the 
variables  computed  at  the  first  stage.  The  set  of  points  where  the  variable  U  is  calculated  at  the  second 
stage  is  indicated  as  .  This  process  will  be  continued  till  U  values  are  computed  in  all  the  points 

of  the  initial  domain.  Tlie  sequence  of  U  variable  computation  domains  result 

of  the  process.  Computation  of  U  variable  in  all  the  points  of  is  carried  out  parallel  but 

transition  from  the  one  domain  to  another  is  sequential.  Similar  domains  systems  are  to  be  designed 
for  all  the  variables  included  into  MSCG. 
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Complicity  of  the  sets  structures  caused  impossibility  of  writing  the  traversal  of  all 

the  points  from  domains  in  the  form  of  effectively  realized  loops’  systems. 

On  the  other  hand  the  following  representation  seemed  acceptable:  a  set  of  points  belonged  to 
the  domain  can  be  set  by  an  equation  of  hyperplane  given  below: 

4(/f):/f/;+...+pf;/„+A„=/f  (3) 

where  k  -  integers. 

We  arrange  sets  4(y)  in  the  following  way.  Let  contains  the  points  where  the 

computation  of  U  values  depends  on  the  external  data  only  and  these  points  satisfy  a  condition  given 
below: 

Set  L^{2)  consists  of  the  points  where  the  computation  of  U  values  depends  on  the  external  data  and 
on  the  values  computed  in  the  points  of  set  Ly(1)  besides  these  points  satisfy  a  condition  given 
below: 

p“/;+...+  /f„/„+A„=2 

We’]]  continue  this  process  ti]]  all  the  values  of  variable  U  in  the  initial  domain  are  computed 
Example:  Let’s  consider  relations: 


set  on  the  domain  1  <  /  <  7,  1  <  y  <  7 

The  value  in  the  f/Jjsquare  points  out  number  K  of  corresponding  set  .  E.g.  the  value  in 
(4,4)  square  of  X  variable  equal  to  7  means  that  variable  X in  the  point  (4,)  is  computed  on  the  7th  step 
of  model’s  functioning. 


1 

1 

1 

I 

1 

1 

1 

1 

2 

2 

2 

2 

2 

1 

7 

6 

4 

4 

4 

3 

1 

8 

8 

7 

6 

5 

5 

10 

9 

9 

8 

7 

6 

12 

11 

10 

10 

9 

8 

14 

13 

12 

11 

11 

10 

7 

6 

5 

4 

3 

2 

1 


Ty 


6 

5 

3 

3 

3 

1 

1 

7 

7 

5 

5 

4 

1 

1 

9 

8 

7 

6 

6 

1 

1 

11 

10 

9 

8 

7 

1 

1 

13 

12 

11 

10 

9 

T” 

14 

13 

13 

12 

11 

1 

1 

1 

1 

1 

~r 

T” 

7 


Difficulties  in  the  designing  of  loops’  system  for  effective  traversal  of  5*  set  points  are  evident. 

Let’s  create  the  system  of  sets  belonging  to  the  plane  that  is  defined  by  the  equations: 

-i+2j+4=k 

y/f):  -/+2y+5=/c 

The  tables  where  the  values  in  the  squares  determine  belonging  of  a  particular  point  to  the  set  are  given 
below: 


4(/f) 
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Differed  from  sets  S  *  sets  L{k)  have  regular  structure.  It  gives  a  chance  to  write  the  sequence  of 
computations  in  the  form  of  not  complicated  loop  structure. 

Given  tables  show  us  that  at  each  step  of  computation  there  is  a  set  of  points  where  variables 
values  are  computed  simultaneously(parallel). 

E.g.  the  values  of  variable  Y  in  the  points  (1.7)  (3,6)  (5,5)  (7,4)  and  the  values  of  variable  X 
in  the  points  (2,6),  (4.5),  (6,7)  are  computed  parallel  at  step  K=S. 

All  the  points  where  variable  X is  computed  parallel  lie  on  the  one  line. 

The  values  of  Tin  the  points  (2,6)  (4.5)  (6,4)  and  the  values  of  X  in  the  points  (1.6)  (3.5) 
(5.4)  (7.3)  are  computed  at  the  next  step.  All  the  points  where  variable  Y(X)  is  computed  at  this  step 
also  lie  on  the  one  line. 

Let's  assume  that  there  is  such  a  hyperplane  where  we  can  orginize  computations  in  the  way 
described  above.  Introduce  a  definition. 

Definition.  Front  of  computation  of  variable  U  at  the  step  K  is  a- set  of  points  {/, . /J 

where  the  value  of  u  may  be  computed  parallel  and  the  points  belong  to  hyperplanc 

U^):4^+--+P“/n  +  A„=/f 


Ly{l^ 
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Changing  K  values  we  obtain  the  family  of  hyperplanes  which  covers  all  the  points  of  the 
initial  domain. 

We  can  show  that  fronts  of  computations  included  into  MSCG  and  satisfy  condition 
4  >0  /C  =  1,  n  are  parallel  hyperplanes. 

Consider  the  relation  included  into  MSCG 

point  ( 4 . ..  4)  G  ),  point  C 

the  value  of  V  must  be  computed  before  U  value  is  available,  so 

fl/>...+p,/;+A,  =  /Cg>/c,  =  p^(/;  +  ^)+...+p,(4^ 

The  main  inequality  is  derived  from  the  one  given  above.  All  the  parameters  of  hyperplane  must 
satisfy  it 

n 

(5) 

/f=1 

Consider  computations’  fronts  relations  for  variables  U  and  V  included  into  MSCG. 

Uk)'J,Rii  +  ^u=l< 

I 

L,{k):YF\i,  +  A,  =  l< 

I 

Lets  consider  some  point  with  coordinates  (^,.,,4)  belonged  to  the  initial  domain.  The 
computation  of  U  value  in  this  point  is  carried  out  at  step  and  the  computation  of  V  value  is 
done  at  K^sitp,  i.e. 

I 

I 

Subtracting  the  second  equation  from  the  first  one  we  obtain: 

A,-A,  =  K,-K, 

I.e.  difference  A^  — shows  us  “delay’  (“outstripping")  of  L/variable  in  comparison  with 
V  variable. 

Now  we  can  devide  the  question  of  loop  process  organization  into  two  questions. 

1 .  Definition  of  computation’s  front  hyperplane’s  parameters. 

2.  Definition  of  coordinates  of  the  points  lying  on  the  front’s  hyperplane  (  i.e.  determination 
of  the  parameters  of  the  loops  providing  traversal  of  all  the  hyperplane’s  points). 


Statement  of  Linear  Integer  Programming  problem. 
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Let  the  system  of  relations  (1)  be  set.  Parameters  (p},.. are  to  determine  for  variables 

1  m  / 

X,...,  X.  Besides  values  of  A^.  are  to  be  defined  for  every  variable  X.  System  of  inequality  (5) 
defines  the  conditions  for  p, Al  /  =  1,  A77  to  satisfy. 

Computations  in  the  points  lying  on  the  one  hyperplane  are  done  parallel.  But  transition  from  the  one 
plane  to  another  is  sequential.  Thus  the  number  of  the  planes  covering  all  the  points  of  the  initial 
domain  determines  ( number  of  a  sequential  steps  of  computations. 

The  formula  for  Kp  is  given  below. 

where  N^M  j  are  the  upper  and  the  low  boundaries  of  changing  i  coordinate  domain. 

Thus  the  task  of  wave’s  front  determination  comes  to  solution  of  LIP  problem  of  the  type 
given  below. 

Find  which  satisfy 

/=1 

with  following  constraints 

A,-A^ >0  1,1,  =Un 

/=1 

/ 

where  5]^^  is  a  displacement  to  index  /  in  the  relation  defined  dependence  of  X  on  X  during  the 
computation. 

Nowadays  direct  algorithm  of  integer  programming  is  used  for  solving  such  a  task  [4]. 

Note  1.  Further  we'll  consider  only  those  MSCG  where  formulated  task  of  LIP  has  a  solution. 
We  don’t  consider  the  question  of  computation  organization  in  the  case  of  sequential 
computing.  E.g.  for  the  relations  of  the  following  type 

with  2^i  ^  M  2  ^  J  ^  N 

there  is  no  front  of  computations  ( as  it  is  defined  above).  It  is  derived  from  the  fact  that  inequalities  of 
(5)  type  for  the  given  example  has  the  following  form 

p,  -pj  >0 
— >  0 
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A^+/?2>0 

We  can  see  from  the  given  inequalities  (  the  first  and  the  last  inequalities  are  conflicting)  that 
there  is  no  values  Px^Pi  defining  the  front  of  computations  for  variables  A  and  B  but  the 

computations  for  the  given  relations  may  be  organized  in  the  form  of  the  simple  sequential  loops. 

Note  2.  Presence  of  absolute  values  in  the  function  demands  solving  of  several  tasks  and 
choosing  of  the  optimal  one.  We  have  to  face  the  real  tasks  with  the  dimension  of  no  more  than  3  and 
it  allows  to  judge  of  practical  applications  of  this  approach. 


Program  scheme  for  the  one  operator. 

Let’s  assume  that  variable’s  values  computed  in  the  left  part  are  not  used  in  the  right  part  i.e. 
there  is  no  recurrent  dependence.  It  means  that  all  the  values  of  the  used  variables  are  known  and  the 
computation  can  be  done  parallel  for  all  the  points  of  the  domain.  The  scheme  of  operators’  loop  has 
the  following  form: 

DO  LABEL  ix=Mx,Nx 


DO  LABEL 


LABEL 


Now  assume  that  there  is  a  recurrent  dependence  in  the  operator.  In  general  the  task  of  finding  the 
parameters  of  wave’s  front  is  to  be  solved  first  and  then  you  may  start  designing  the  loops’  scheme. 
The  exception  might  be  such  an  index  (  or  the  group  of  indexes  )  which  displacements  has  the  one 
sight.  If  such  an  index  exists  we  won't  consider  it.  The  header  of  the  loop  corresponding  to  particular 
index  has  the  form: 

DO  LABEL  iirMk,Nk 
in  the  case  of  the  negative  displacements 

DO  LABEL  ik=^MioNkA 


in  the  case  of  the  positive  displacements. 

Let’s  create  a  new  graph  G/7  derived  from  the  initial  graph  G  by  deleting  the  arcs  where 

corresponding  mark  is  strictly  positive.  Then  use  procedure  ot  tinding  index  with  displacements  ot  one 
sign.  Tf  there  is  no  one  like  that  then  for  the  rest  dependencies  on  other  indexes  you'll  have  to  solve 
the  task  of  determination  of  computations*  front  parameters. 

Hyperplane  of  computations’  front  is  said  to  be  defined.  Tlie  scheme  ot  the  program  in  this 
case  will  have  the  form: 

DO  LABEL  K=Kb,Ke 

DO  LABEL  CONC  ALL 
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LABEL  <  operator,  variable  computation> 

where  L(k)  is  an  equation  of  hyperplane.  Record  CONC  ALL  e  L{k)  means  that  the 

loop  provides  processing  of  all  the  points  lying  on  the  hyperplane  L(k)  and  the  computation  in  this 

points  may  be  earned  out  parallel.  External  loop  is  sequential  and  it  provides  transition  from  the  points 
of  the  one  hyperplane  to  another 

Front  of  computations  has  the  form: 

Pxh+-+PX  =  k 

( In  the  case  of  one  relation  A  =  0 ) 

Changing  K  values  we  obtain  the  system  of  hyperplanes  which  covers  all  the  points  lying  in 

the  initial  domain.  It  is  evident  that  minimal  and  maximal  values  of  K.  variable  are  on  the  domain’s 
boundaries  and  they  are  equal  to: 

K,  =  Y.P^N,^Y,PM>  (6) 

Pi<0  p,>Q 

K  =  ^PMi  +  Y.PiN,  (7) 

p,  <0  pi  >0 

Now  let  s  consider  the  question  of  designing  loops’  system  providing  processing  of  all  the  points  of 
hyperplane.  First  let’s  assume  that  all  i  =  \m  and  the  greatest  common  devisor  of  the 

numbers pi  are  equal  to  1. 

Indicate  f,  =  (8) 

Initial  equation  assumes  the  form 

Pi>i+ti=k-A  (9) 

Solution  of  this  equation  depends  on  the  parameter  tf,  ( integer )  and  has  the  form: 
i^  = 

t,  =  {k-A)-pid  (10) 

It  is  natural  that  you  need  to  choose  only  those  solutions  which  belong  to  the  initial  domain.  E.g. 
value  i ^  must  satisfy  inequalities 

i.c.  A/,  <  c/j  <  .  Besides  let’s  define  the  boundaries  of  changing  ^’alue  of  /j . 

Indicate 

n  n 

Cx  =  (11) 

/  =  2  /=2 

Pi>0  /7j<0 
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Cn  =  +  %PiM.  (12) 


Pi  <0  /7,  >0 

It  is  evident  that  ^  ^  .  Substituting  instead  of  f ,  its  expression  into  the  last  inequalities 

and  solving  it  on  d,  we  obtain 


fO-fl,  /0_.l 

*1  ^min  >>  ^  ^  *1  *max 


if  Pi  >  0 


r°-/'  P  — r' 

^  max  >  ^  >  >  min  *1^  <  0 


where  /f*  =  A:  —  A 

In  the  result  the  domain  of  changing  value  of  is  set  by  inequalities 

>4 


/O  _  t' 

where  c/f=  max(  M„  ^ - 22L) 


t°-tL 


c/;'=i  min(A'.-J - =.)  (13) 

L  Pi  J 


for  Pi  >  0 


d\=  max(M„-J - =-) 


<=  min(iV,,-^ - 22L)  (14) 


for 


Pi  <0 


Consider  the  rest  part  of  the  formula 

yW;+...+p„/„  =tl 

where  =  {k  —  A)  —  p\Cl^  =  f °  —  P,d, 


Indicate 


4=ft4+...+p,/; 


Equation  ( 1 5)  assume  the  form 

A4+^2  =  ^2 


Its  solution  has  the  form 


4  =  4 
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for  >  0 .  In  the  case  of  p^<0  \ht  formulae  are  derived  in  the  same  way. 

We’ll  continue  this  process  till  the  initial  equation  assumes  the  form 
Pn-^ir^^+  d^) 

Let’s  consider  that  and  p^  are  reciprocals.  Then  at  first  using  Euclid’s  algorithm,  we  solve  the 
equation 

P^lLl+Pn/«  =  1  08) 

Let  . be  the  solution  of  the  equation  ( 1 8).  Then  it  will  be  evident  that 

/n=^''^ll+CLr’'Pn-1 

are  the  solution  of  equation  ( 1 7).  From  the  conditions 

Mn<in<N, 

we  obtain  the  range  of  changing  value 

C(ti  > 
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where  = 


Pn  Pn-^ 


0^.1= 


Pn  P/v-l  . 


in  the  case  of  Pp>0 


where 


t  =  /°*  /° 

‘/3  'n  ‘n-1 


(19) 


(20) 


The  rest  expressions  for  ,  values  with  the  different  combinations  of  the  signs  p^ 

values  are  derived  in  the  same  way.  Thus  the  scheme  of  the  loop  provided  processing  of  all  the  [points 
of  hyperplane  has  the  following  form: 


t,°  =  k-A 


DO  LABEL  Di=Df,D/ 


DO  LABEL  D2=D^,DI 


DO  LABEL 


n  =  D*  D® 

—  ^n-V  ^n- 


'n-V^n-^ 


LABEL 


Note;  Assume  that  as  some  index  exists  then  parameter  of  hyperplane  p^  =0.  Hence  the  equation  has 
the  form: 

P/>-+A-i4-i+  A^iLi+-”+Pr/n  +  A  =  A'  (21) 

In  this  case  we  consider  algorithm  of  loop  designing  for  the  equation  as  it  is  given  above(21).  The 
header  of  the  loop  corresponding  to  the  direction  will  have  a  form: 

DO  LABEL  =  A/;^ 
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Loop  on  the  direction  can  be  formulated  as  both  external  and  embedded. 

Structure  of  index  expressions. 

The  order  of  the  embedded  loops  for  processing  of  all  the  points  of  hyperplane  are  deteimined 
above.  Each  direction  of  the  space  corresponds  to  some  loop  index.  For  the  first  n-2  directions 
correspondence  has  the  form: 

ik  =  ^i< 

For  the  last  two  indexes  correspondence  has  more  complicated  form: 

ir^^=ta+dn-^*Pn 
in=tp  +  4>-^*Pn-^ 

Le.  index  structure  of  the  relation  ( /^, . ,  i^)  corresponds  to  the  following  structure  in  the  program: 

(of, ... .  ctz,  t,  +  p„, tp - 

The  numbers  Pr^v  Pn  must  be  reciprocals  for  the  solution  of  equation(18).  Let  them  be  non 

reciprocals.  Then  before  start  an  algorithm  we  must  find  two  reciprocals  among  n  numbers  of 

Such  numbers  must  be  found  by  all  means  because  at  the  beginning  we  assume  that  the 

greatest  common  devisor  of  the  numbers  JC),. . jCJ,  are  1.  This  constraint  doesn't  change  an  algorithm. 

In  this  case  only  the  correspondence  of  the  space  directions  and  loop  indexes  is  changed.  The 
correspondence  for  this  case  is  given  below. 

/;  =  c( 

(-1  =  4-1 

=  4 


^s-1  ~  4-2 


4=  cti 


4=^/J  +  4,-1*Pr 


Front  of  computations  with  complicated  parameters. 
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Previous  computations  were  carried  out  having  assumed  that  the  greatest  common  devisor 
(5)  differs  from  1.  Let’s  consider  changes  in  the  algorithm  in  the  case  5  1 . 

Let  the  initial  front  has  the  form: 

6{p{i,+...+  pJ„)+A=k 

Represent  A  in  the  form  ofA  =  ^  +  r,  0<  r<  5 

Let’s  show  that  there  are  the  points  on  the  hyperplane  for  every  value  of  K  .  Let  the  point 
with  coordinates  0  lying  on  the  hyperplane  L^{ko)  be  at  the  step  k^.  Assume  that  the  point 

on  the  hyperplane  L,  (Aq  +  f)  exists  at  the  step  k^+t,  t  <S .  Then 
6{p^+...+  p„0  +  A  =  k 
5(iq/7+...+  p„0+A=/(o+f 

are  the  equations  where  the  following  formula  is  derived  from 

=  j  (23) 

There  is  an  integer  value  in  the  equation  from  the  left  but  there  is  none  from  the  right.  Hence  the 
statement  of  the  point  existence  on  the  hyperplane  L^(  Aq+  f)  isn’t  right.  Further  more  from  the  last 
equation  we  can  see  that  the  points  for  the  initial  equation  of  the  hyperplane  lie  on  the  planes  which 
are  5  value  apart,  i.e.  if  there  are  the  points  on  the  plane  L^iko) ,  then  the  new  points  lie  on  the 

planes  L^{ko+ IS)  where  /=±1,  ±2,... 

Proposition.  The  sequence  of  the  computations  for  the  front 

mk):  fl/;+...+  p„/;+A=/c  (24) 

is  equivalent  to  the  sequence  of  the  computations  for  the  initial  front(22). 

Proof.  First  let  ‘s  show  that  at  the  initial  value  K  =  for  the  initial  front  the 

computations  are  carried  out  in  the  same  points  and  for  the  front  in  the  form  (24)  for  K  —  where 

p>0  p  <0 

and  A  is  taken  from  the  representation  A  =  ^  +  T 

From  the  representation  and  there  -h  T  is  derived 

Consider  point  P„)  6  •  Let’s  show  that  this  point  belongs  to  the  front  of  (24) 

type  for  K  =  i.e.  . /^)  €  ,i.e. 

5(pi?+...+p„/°)+A  =  K’;  (25) 

We  must  show  that 

R^+--+Pni°n+’^=f<l 
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Substitute  expression  K]j  into  (25) 

5()q/f+...+p„0  +  A  =  K'2*5  +  r 

Taking  into  consideration  that  A  =  8K  +  T  we  have 

5{  p/?+.. .+ p,/° +A)  +  r=K^*5 

i.e.  p{^+...+pf,+A=Kl 

We  have  shown  that  the  points  lying  on  the  front’s  plane  exist  not  for  every  K  for  the  initial  equation. 
If  the  first  are  on  plane  L^{K^)  then  the  other  lie  on  plane  L^{Ktj  +3*  n)  >  Let’s  show  that  for  any 
point  lying  on  this  plane  there  exists  the  plane  of  front  set  in  the  form  (24)  (Lz{Kl  +  rn) 

Let  P„)emKl  +  5*m) 

Let’s  show  that  given  point 

6{  .+  p,/°)  +  A*6  +  r=Kl+6m 

We  must  show  ~  K^-^-  m 

We  have  (5  +  r 

Substituting  the  expression  for  we  obtain 

p^+...+pf,+l=K^+m 

It  is  derived  from  the  proposition  1  that  in  the  case  5^1  there  is  enough  to  transit  from  the  front  of 
(22)  type  to  the  front  of  (24)  type  and  then  to  use  the  algorithm  of  loop  parameters  definition  given 
above. 


Loop  structure  for  some  relations  in  the  case  of  the  simple  front. 

/ 

Let  the  front  for  X  variables  included  into  MSCG  has  the  form 
Lj {k):  pi,+...+pJ,  +  Aj  =  k  j  =  lTm 

Tlie  dimension  of  all  the  variables  is  said  to  be  ;?  and  p,.. p^  are  reciprocals.  As  appears  from  the 

above  if  the  dimensions  of  the  domains  and  A  values  are  the  same  for  the  variables  then  loop  headers 
are  the  same  too.  Thus  in  this  case  the  formulae  setting  computations  for  such  variables  may  be  written 
together  in  the  body  of  designed  loops'  system.  If  this  condition  isn't  performed  then  first  we  define 
K.  and  Kg  as 

Kf,  =  min ,  AT,  =  max  j  =  Lm 

/  *'  / 
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Further  we  must  put  the  test  of  the  following  type  before  every  structure 

K,<K<K, 

and  only  when  the  condition  is  performed  we’ll  start  the  loop  providing  search  of  the  points  on  the 
front’s  plane  with  current  K  value  for  the  given  variable. 

Note.  Though  if  there  are  still  variables  set  on  the  one  domain  and  had  the  same  A  value 
they  may  be  united. 

Thus  the  scheme  of  loop  in  this  case  has  the  following  form: 

DO  LABEL  K=K^,Kg 

IF  K>K>aK<KI 

DO  LABELj  CONG  ALL  e  y  {K) 

kiv-Jn)= . 

h 

LABEL]  x(/;,...,/;)= . 

IF  k>k;,aK<k^, 

DO  LABELr  CONG  ALL  i„)  e  L^{K) 

kk>-Jn)= . 


LABELr  i„) 


LABEL  GONTINUE 

h  h 

where  values  of  variables  are  common  for  variables  X,... ,X  and  /C^,  K^-  for 

variables  X,... ,X. 

Consider  the  case  when  p,., aren’t  reciprocals.  Remember  MSCG  consists  of  the 
several  operators. 

Let  the  computations’  front  of  U  variable  has  the  form 
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U  ky.  5{  . .+  p„i,) +AjS+r,=  k 

and  for  V 

/.,{ A):  5{pi,+. . .+  p„/„)  +A^+r^=k 

It  is  evident  that  computation  of  U  and  V  are  carried  out  for  the  same  K  values  in  the  case  when 


r,=  r,. 

It  is  not  difficult  to  show  that  if  r^>  then  computation  of  V  is  carried  out  steps  later  after 

computation  of  U  variable  (  though  it  can  be  done  in  the  diiferent  points  of  the  domain).  While 
transition  to  the  front  of  computations  with  simple  parameters  the  computations  of  all  the  variables 
are  carried  out  for  the  same  k  values.  In  this  case  the  scheme  of  the  program  must  be  changed. 

All  the  relations  are  united  in  the  groups. 

The  relations  included  in  the  one  group  are  carried  out  parallel  as  it  is  done  before.  Though 
every  group  begins  processing  only  after  the  operators  of  the  previous  group  end  it. 

Let’s  formulate  the  rules  of  uniting  the  relations  into  the  one  group  and  the  order  of  the  group 
movement.  Uniting  of  the  relations  is  done  in  the  following  way.  Let’s  represent  variable  Ay  in  the 

form  of  Sy*(5  -f  Ty.  All  the  relations  where  Ty  values  are  equal  are  united  in  the  one  group.  Define 
the  order  of  the  groups’  movement  Consider  the  initial  front  of  the  wave 

(5(  fl/i  +. . .+  pj^)  +  A  =  /c)  and  determine  the  minimal  value  of  for  every  X  variable.  Let’s 

arrange  the  groups  according  to  the  following  rule.  The  first  will  be  the  group  which  includes  the 
relation  where  assume  the  value  (  on  all  the  values  of  Given  group  unites  all  the 

relations  where  =  The  next  group  unites  the  relations  where  r=rQ+1.  If  there  is  no  such 
group  then  choose  the  group  with  r=  -h  2  etc.  Note  /^  <  5  .  If  we  define  the  place  of  the  group 
with  r  =  5  —  1  then  the  next  group  will  be  with  r  =  0 ,  etc. 

Note.  Let’s  consider  the  question  if  the  formulae  included  into  the  different  groups  (/J  fy) 

have  the  same  minimal  value  K,,,  i.e. 

/  /■ 

Let  the  wave's  fronts  for  X  and  X  has  the  form: 


L,(A):<5(fl/;+...+p„/„  + 
Lj{k):5{pi,+...+pJ„+ 


Assume  that 


A,)  +  /;  =  A 
Aj)  +  r^  =  k 


n-r, 


)+.. .+  p„{i'„  -  // ) + A,  -  A  y  = 

Because  ^  Ty  and  |/J  —  /^|  <  5  our  assumption  isn’t  right. 
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The  scheme  of  the  program  is  given  below. 

/C,=... 

DO  1  K^K^,K, 

Computations  on  the  relations  included  into  the  1st  group 
Computations  on  the  relations  included  into  the  2nd  group 

Computations  on  the  relations  included  into  the  5—1  group 
1  CONTINUE 

Transition  from  the  computations  of  the  one  group  to  another  is  sequential.  Computations  of 
the  relations  included  into  the  one  group  are  organized  in  the  way  similar  to  the  case  of  the  front  with 
simple  parameters  i.e.  parallel  inside  each  relation  and  besides  each  relation  separately. 


DO  1  K=Ki,,Kg 

DB=MAXX(3,(K-N)/2) 

DE=MINN(M,(K-3)I2) 

DO  2  D=DB,DE 
A{D,K-2*D)  =  B{D,K-2<-D-r1j 
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2  B{D,K-2^D)  =  A{D-Z  B(D,/C--2*D--1) 

1  CONTINUE 

Note.  Function  MAXX(X,Y)  gives  us  the  nearest  integer  exceeded  the  greatest  of  the 
numbers X and  7  as  a  result  Function  MINN(XyY)  gives  us  the  nearest  integer  which  doesn’t 
exceed  the  least  of  the  numbers  X  and  7. 

Embedded  loop  may  be  carried  out  for  all  D  values  .  This  loop  corresponds  to  the  computation 
of  the  values  A  and  B  on  particular  line  (2*  /+  y  =  K)  with  fixed  K  value  Transition  of  the  lines 
(loop  on  K)  is  sequential.  Common  number  of  the  steps  in  the  loop  is  2M+N‘S. 

If  we  don’t  solve  the  LIP  task  we  may  first  analyse  index  displacements  on  each  direction. 
Note  that  displacements  on  index  i  has  one  sign.  Hence  we  may  not  take  into  consideration  link 
between  the  variables  on  this  index  in  those  cases  when  the  displacement  is  non-zero.  We  can  see  the 
result :  variable  B  doesn’t  depend  on  A.  We  can  propose  the  following  scheme  of  the  loop. 

DO  1  /=3,M 
DO  2  j=3,N 
A(iJ)=B(i,j-l) 

1  B(  i,j)=A(  i-2,j+2);B(  i,j- 1 ) 

Program  loop  seems  easier  in  this  case.  First  you  mustn’t  compute  the  boundaries  of  changing  indexes 
of  the  embedded  loops  (as  you  must  in  the  case  of  the  first  solution).  Second  the  structure  of  the  index 
expressions  is  simple  enough.  Though  loop  designed  in  this  way  is  sequential  (  both  on  i  and  y).  You 
may  do  parallel  only  computations  of  A  and  B  when  i  and  j  are  fixed.  Common  number  of  sequential 
steps  is  ( M  —  2)*  ( A/  —  2)  in  this  case. 

Example  2.  Let  the  relations  be  set 
j  ~  +1,  y-1  *  ^+1 ,  /+1 »  ^/+1 ,  j  ) 

^/,  j  ~~  y-1  >  ^+1,  j  ’  ,  y -1 ) 

j  “  j-V  ^'~l,y »  y-1 ) 

the  domain  of  changing  indexes 

y=2 . 

The  fronts  of  computations  for  X,  7,Z  are  indicated  Lj^{l<),  Ly{k),  L^{k)  correspondingly.  LIP 
problem  solution  has  the  form: 

R  =  -Z  R>=6,  A^  =  5,  A^=Q  A^  =  3 

The  values  of  pi  and  p2  are  not  reciprocals.  The  fronts  of  computations  has  the  form: 

4(/c):  -i+3j+2  =  k  r^=1 
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Ly{k):  -i+3j=k  r^  =  0 

4(/c):  -/+3y+1  =  /c  r^  =  1 

It  follows  from  given  above  that  computations  of  values  X  and  Z  are  carried  out  parallel  (  because  X 
and  Z  are  included  in  the  one  group  and  Y  in  another  one)  and  sequentially  after  the  computation  of  Y. 

The  scheme  of  the  program  is  given  below. 

K,  =  -M+6 

K,  =  -2+3*N 

DO  LABEL  K=Kt„Kg 

DB=MAXX((2+K)l3a) 

DE=MINN((M+K)I3,2 ) 

DO  LABELl  DY=DB^,DE^. 

LABELi  Y{3^DY-K,DY)  =  X{3*  DY  -  K,DY-^)■,  Y{3*  DY-K  +1,  DV); 

Z{3^DY-K-\DY-1) 

DB=MAXX(K/3,2) 

DE=MINN((M+K-2)/3,N) 

DO  LABEL2  DX-DB^DE^ 

LABEL2  X(3* DX-K  +  ZDX)  =  X(3h  DX-K+3  DX  -1);  ^(3*  DX  - /<  +  3 DX  +1); 

Zi3^DX-K+3DX) 

DB=MAXX((K+\)I3,2) 

DE=MINN((M+K-\ )/3,N) 

DO  LABELS  DZ=DB..DE. 

LABELS  Z(3^ DZ-K+X DZ)  =  X(3^ DZ  -  K, DZ  -1);  Y{3* DZ -  K,  DZ)\ 

Z{3t=DZ-X+1,DZ-1) 

L.4BEL  CONTINUE 

E\  cr>'  turn  of  the  loop  on  K  consists  of  two  sequential  steps.  At  the  first  step  the  values  of  variable  Y 
in  the  front’s  point  -i+3j=k  are  computed  parallel,  at  the  second  step  the  values  of  variable  X  in  the 
front's  points  i-+3j+2=k  and  the  values  of  variable  Z  in  the  front’s  po'mls-i+3j+ 1 =k.  The  order  of 
computations  corresponding  components  of  matrixes  X,  Y,  Z  in  the  case  M'=l .  N—1,.  E.G. 

X(5.5  )=27  means  that  the  value  of  variable  X  in  the  point  (5.5)  is  computed  at  27  step  (the  first 
computation  is  carried  out  at  step  1). 

X  Y  Z 
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Introduction. 

NORMA  is  a  programming  language  [1]  aimed  at  automation  of  mathematical  physics 
problems  solutions  on  parallel  computer  systems. 

The  NORMA  language  allows  elimination  of  a  programming  phase  in  transition  from 
formulae  specified  by  a  technical  expert  to  a  program  itself.  There  is  no  much  difference  between 
formulae  and  NORMA  specifications.  In  fact  these  formulae  are  input  data  for  a  translator. 

Synthesis  of  output  program  is  carried  out  automatically  during  the  translation  from  NORMA. 
The  order  and  the  way  of  performing  calculations  (parallel,  vector  or  sequential)  is  determined 
automatically.  The  order  of  the  language’s  sentences  is  arbitrary  (  information  dependencies  are  revealed 
and  taken  into  account  during  the  organization  of  computing  process  ).  There  are  no  such  programming 
terms  as  memory,  loop,  control  operators  in  the  language.  Output  program  is  generated  with  the 
architecture  of  a  target  computer  as  a  guide. 

In  fact  the  program  in  NORMA  is  a  nonprocedural  specification  of  the  problem  to  be  solved. 
The  synthesis  of  output  program  raises  some  mathematical  problems  but  they  are  solvable  in  the  case 
of  NORMA  language. 

Some  Norma  peculiarities  makes  the  process  of  automatic  object  program  design  available  for 
practice  realization.  They  are: 

1.  Index  expressions  of  calculated  variables  has  the  form  i±C 
where  /  -  index  name,  C  -  integer  constant. 

NORMA  is  a  language  with  single  assignment.  Any  value  can  be  assigned  to  a  variable  only 
once  (  only  once  to  each  point  of  domain  -  to  the  variables  defined  on  domain  ).The  first  constraint 
defines  the  class  of  formulae  which  can  be  used  for  the  problem’s  solution  .  It  isn’t  strict  in  practice 
as  the  index  expressions  of  other  type  are  very  rare. 

Memory  allocation  and  the  problems  of  its  economy  caused  the  second  constraint.  These 
problems  can  be  solved  at  the  translation  stage.  The  second  constraint  simplifies  the  problem  of  output 
program  synthesis. 

The  problem  of  output  program  is  to  be  solved  during  the  translation.  Solving  this  problem  is 
based  on  the  analysis  of  the  graph  of  information  dependencies.  The  Most  Strongly  Connected 
subGraphs  (MSCG)  are  chosen  from  the  graph.  In  general  case  organization  of  computations  for  the 
nodes  requires  use  of  special  methods. 

Assume  is  a  principal  operator  in  NORMA.  This  operator  sets  the  relations  between  variables 
being  calculated  on  a  domain.  Researching  on  the  subject  of  computational  process  organization  has 
been  doing  for  a  long  time.  The  notion  of  local  relation  on  an  array  was  introduced  in  [5]  and  called 
parameter  record.  Synthesis  of  loop  programs  for  parameter  record  in  the  case  of  one-dimension  array 
was  investigated  in  [6].  The  problems  of  existence  and  degree  of  parallelism  of  computational  process 
on  multidimensional  arrays  are  considered  in  [7].  Organization  of  loop  computations  for  relations  set  on 
half  limited  domains  is  described  in  [8].  The  problems  of  parallel  program  synthesis  on  the  base  of 
special  formalization  of  application  domain  are  investigated  in  [9].  ' 
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The  purpose  of  this  paper  is  to  specify  the  method  of  designing  loop  operators  which  realizes 
the  relations  included  into  MSCG.  Given  work  is  a  continuation  of  [2]. 

1  FRONT  OF  COMPUTATIONS. 

Executive  part  in  the  NORMA  language  may  be  represented  in  the  following  form: 

1  1  m 

Xl  ^ 

.  (1) 

/77  1  m 

Here  /  ={ /^,..., /^}  denotes  the  array  of  indexes  corresponded  to  calculated  variables 
1  m 

X, ,  X,  Dj  j  =  {  di  jj , .. ,  fl^y}  denotes  the  array  of  displacements  to  index  variables,  C^j  -  integer 

constants  in  NORMA  as  it  is  defined  in  Norma. 

Note.  The  use  of  one  and  the  same  value  with  different  index  expressions  is  possible  in  the 
right  part  of  the  relations.  Besides  the  independence  from  some  variables  is  also  acceptable.  The  form  of 
relation’s  system  is  used  for  simplification. 

The  computations  defined  by  t  relation  are  said  to  be  performed  in  the  domain: 

M[<i^<N\,  /f=lTn  (2) 

The  values  required  for  computations  and  lain  outside  the  domain  are  considered  known.  The  order  of 
computations  isn't  set  in  an  explicit  way  thus  information  dependencies  are  to  be  analysed  and  then 
computations  are  arranged  on  the  result  of  analysis. 

Directed  graph  is  constructed  in  accordance  with  the  given  system  of  relations.  Every  relation 
corresponds  to  the  graph’s  node.  Information  dependence  of  the  following  type: 

k  t 

Xf  =  f;f(...,X/+0«,...) 

t  k 

corresponds  to  the  graph’s  arc  from  node  X  to  node  X  with  mark  . 

The  most  strongly  connected  subgraphs  are  searched  in  the  obtained  graph  and  the  graph  is 
reduced.  A  new  graph  is  a  non-loop  and  thus  the  procedure  of  arranging  nodes  according  to  the  relation 
of  ‘'computed  before”  order  is  possible.  Determination  of  the  loop  process  for  the  operators  not  included 
into  MSCG  is  easy. 

The  problem  of  loop  process  determination  for  the  operators  included  into  MSCG  will  be 
considered  below. 

At  first  we  consider  the  methods  of  computations  sequence  informally. 

Definition  1.  Set  of  the  points  satisfied  equation 

t 

A(X, /f)  =  ,1„}:  /q^+...+p^„+A' =/f,  (3) 

t 

are  called  basically  front  and  equation  (3)  -  basical  equation  of  X  variable  at  k  step  computations’  front. 
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Note.  Difference  A^,  f,  /  =  1,  A77  defines  several  steps ,  that  computations  of  variable  X 

I 

“delay”  (outstrip”)  from  the  computations  of  variable  X  in  the  same  point. 

Note  that  the  range  of  changing  step  k  value  is  defined  by  all  means  for  the  initial  domain  (2). 

Let  the  range  of  step  k  changing  has  the  form:  .  Computation  of  the  variable’s  values 

consists  of  the  sequence  of  steps  in  the  initial  domain.  At  first  value  k  is  assigned  to  variable  k  : 

t 

k  =  and  the  values  of  variable  X  are  computed  in  all  the  points  of  basical  front  (3).  Then  start 

t 

with  the  next  step.  1  is  added  to  k  value  (k=  /:+!)  and  the  value  of  X  is  calculated  on  a  new  front. 

Given  process  is  continued  until  k  value  assumes  its  maximum 

/ 

Note.  Every  variable  X  (when  value  of  (^)  step  is  fixed)  is  computed  on  the  plane  generally 

t 

parallel  to  the  plane  where  the  value  of  X  is  computed. 

Definition  2.  Basical  equation  where  the  Greatest  Common  Devisor  (GCD)  =  1  is 

called  reduced  basical  equation. 

Let’s  consider  the  following  system  of  equations: 

A2{X,k2):R>M-^R>.nin+^'2  = 

A,(X,/c,):p,y;+...+p,„/„+A',  = 

Here  p^j,  7  =  H  S=1,  f  f  =  1,  are  integer  numbers. 

The  first  equation  from  system  (4)  is  basical. 

t 

Definition  3.  Front  of  computations  of  X  value  at  /cj  step  is  a  set  of  the  points  satisfied  basical 
equation,  the  order  of  the  points’  traversal  is  specified  by  the  equations(4). 

t 

The  order  of  variables’  computations  is  considered  below.  The  values  of  variable  X  in  the 
opines  with  integer  coordinates  belonged  to  the  plane  set  by  the  basical  equation  of  the  front  are  to  be 

calculated  when  /(^(from  range  [ is  fixed.  The  order  of  other  points’  traversal  is  specified 

•by  other  equations.  The  second  equation  defines  the  family  of  A7-dimensional  hyperplanes  (  according  to 

/C2  values  changing  from  /c^'^to /c^^^).  By  adding  the  first  equation  we  obtain  the  family  of  W-1- 

dimensional  hyperplanes  which  cover  W-dimensional  hyperplane  set  by  the  basical  equation.  Thus  the 

points  on  the  n- 1 -dimensional  hyperplane  are  calculated  first  (when  /Cg  =  ).  Then  1  is  added  to  /Cg 

value  and  the  values  in  the  points  of  the  next  -dimensional  plane  are  calculated  etc.  Let’s  consider 


K 

^  (4) 

K 
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fixed  «-l -dimensional  hyperplane  ( defined  by  the  first  two  equations  when  k^  =  lc^  and  /fg  =  ^2  )•  'The 
order  of  the  traversal  of  the  points  belonged  to  this  hyperplane  is  determined  in  the  same  way:  the  third 
equation  is  considered  and  varying  values  from  to  we  obtain  the  family  of  hyperplanes 
covering  all  the  points  of  n- 1 -dimensional  hyperplane.  The  third  equation  with  the  equations  defined 
n-1-  dimensional  hyperplane  sets  n-2  -dimensional  hyperplane  (when  /^  =  Zfg  is  fixed).If  there  are  no 

other  equations  then  the  order  of  the  traversal  of  the  points  on  this  hyperplane  is  arbitrary  (i.e.  parallel). 
In  the  other  case  the  next  equation  is  considered  etc.  This  procedure  will  be  continue  until  the  last 
equation  is  considered.  It  is  evident  that  if  r=n  then  the  traversal  of  the  points  of  the  initial  domain 
will  be  sequential  i.e.  point  by  point.  If  r<n  the  computation  may  be  carried  out  parallel  in  the  points 
n-  r-t-1 -dimensional  hyperplane.  In  particular  if  the  front  of  computation  consists  of  the  one  (basical) 
equation  then  the  computations  of  the  given  variable  may  be  carried  out  independently  in  all  the  points 
of  the  plane. 

Note.  The  questions  connected  with  the  determination  of  the  front  of  computations  are 
described  in  [3-4]. 

2  Computations  on  the  basical  front 

Let’s  consider  the  problem  of  the  order’s  determination  of  the  initial  domain’s  points’  traversal  when 
the  front  of  computation  is  set  by  the  basical  equation: 

/ 

AiX,k)  =  4  +  A,  =  k  (5) 

Denote  O’,  =  GCZ)(/7,,...,/7„) .  Thus  every  coefficient  of  the  equation  p^,  i=\,n  is  expressed 

in  the  following  way:  =  O’,*  .  Represent  equation  (5)  in  the  form: 

O’,  *{p^iy^...+pj„)+  A',  =  k  (6) 

Denote  =  fj .  Then  equation  (6)  assumes  the  form: 

CT/,  = 

Write  the  solution  of  this  equation: 

t,  =  d[ 

(7) 

k‘  =  A',  +  0,cf,' 

Parameter  d[  is  used  for  a  sequential  examination  of  all  the  planes  covering  the  initial  domain.  As  the 
parameter  has  been  obtained  for  evety  variable  then  let’s  use  the  one  which  is  common  for  all  the 
variables.  From  k'  =  k',  (/,f  =  1,/w)  follows: 


d[ 


d^'  =rr^{dl} 

I,m 


Let 
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Denote  Rj/^  = 


A*'  -  A', 


and  represent  in  the  form 


<  =[<,]+ wu 

where  ]  -  integer  part  of  a  number  and  }  -  its  fractional  part. 

Express  all  d[  (1  =  l^m)  through  : 

=  -[R},^]  (8) 

Parameter  d[  can  assume  only  integer  values.  Sequence  of  k  step’s  initial  value  must  be  kept 

/ 

in  the  transition  to  the  common  parameter.  Thus  variable  r/  is  to  be  linked  with  every  X : 

rl={4,) 

Now  one  and  the  same  parameter  d^^^  is  used  for  the  sequential  examination  of  all  the  planes.  When 
d^^'  is  fixed  the  computation  of  variables  are  arranged  on  increasing  of  r/  values. 

Define  the  range  of  values  changing  for  every  X : 

<c  = 

r,>o  r;<o 

=  Sj-X  +  Zkw'+i^,] 

p;  >0  ;3r  <0 


Common  range  of  d^'  parameter’s  changing  is  a  segment  where 

d^^  =  ,  d^  =  max 

/=l,m  ’  /=l,m  ’ 

The  traversal  of  the  initial  domain’s  points  is  carried  out  by  the  sequential  examination  of  the  planes. 
Transition  from  the  examination  of  the  one  plane  to  another  is  performed  in  the  loop  changing  values 

of  loop  parameter  d!^^  from  (ij^'^to  d^^  .  Range  )  is  common  for  all  the  variables.  As 

/ 

every  variable  X  has  its  own  range  of  computation  steps  ^df^)  the  test  of  the  type  given 
below  is  placed  before  the  computation  of  the  variable  at  d^^  step: 

IF  e(c/, THEN... 


Let’s  consider  the  question  connected  with  the  determination  of  the  points  with  integer 
coordinates  on  the  plane  set  by  the  following  equation: 

Pxh +-+A.4  +  [4,  ]  =  c  10) 

Definition  4.  Further  the  transition  from  the  basical  equation  (5)  to  the  reduced  basical  equation 
(10)  will  be  called  algorithm  of  reduction. 

Represent  equation  (10)  in  the  following  form: 
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cr, (p,V,  =^*‘  -[K,  ]  (1 1) 

where  .  Denote  case  the  equation  has  the 

following  solution; 

I.  =  (?'-«, 

where  and  are  the  solution  of  equation. 

Note.  Choice  of  the  sign  is  arbitrary  in  this  case.  The  domain  of  changing  of  the  values  of 
^2  variable  is  determined  by  the  following  range:  t2  €  )  where 

pI  >0  p'l  <0 

r  =  Sp>;+ 

p]>0  pj 

Taking  into  consideration  the  domain  of  ^2  variable  changing  and  the  boundaries  of  index  variable 
changing  the  range  of  parameter  changing  can  be  defined. 

Then  we  consider  the  following  equation: 

=  (^/j  '  —  —  Pn^2 

Denote  the  right  part  by  7] .  Let  <73  =  GCIXp\ .  ••• ,  aU  )•  Represent  equation  ( 12)  in  the  form: 

.+^L2/„_2)  +  pUn^x  =  ^  > 

and  denoting  /3  =  +. solution  of  the  last  equation: 

^3  ~  “  Pn-\^3 

4-1  ~  ^4-1  "*"^3^3 

Variables  and  are  the  solution  of  the  equation  ^3^3  +  aUi4-i  “  ^  range  of 

parameter  changing  is  determined  in  the  same  way  as  for  parameter  d^ . 

Then  the  process  of  solving  will  continue  till  the  following  equation  is  obtained: 


pr'i,+pr\^L^2 

Its  solution  has  the  form: 

<•,  =  T^2(±pr-d. 

h  =  T,.,l’+pT^d, 

Thus  the  traversal’s  scheme  of  the  initial  domain’s  points  (2)  when  the  from  of  computation 
(5)  is  set  has  the  following  form: 
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dT"  =... 


DO  1  d,=dr,d^^ 


d^^  =... 
d^  =... 

DO  1  d^=d^'\d^^ 


=... 

jinax  _ 

••• 

DO  1  d„=d^'\dr 


c 


JF  d,e  (J,7,c/,7)&...&  e  «",<r)  ™EN 


The  expressions  of  the  index  variables  are  omitted. 


X(...)=... 


IF  d,  €  (47,c?,7)& .  c/„  €  (J,7,r^,7  )  THEN 


1  CONTINUE 

Note  I.  Sequence  of  the  formulae  is  determined  based  on  the  parameters  values. 

Note  2.  The  case  when  the  values  of  some  coefficient  =  0  corresponds  to  a  loop  along  the 
direction  4  initial  value  M and  the  final  value  N,. 

3  Computations  on  the  front  of  common  type 

An  algorithm  of  loop  parameters’  determination  for  the  fronts  of  computations  set  by  the  system  of 
equations  (4)  is  considered.  At  first  we  prove  a  theorem  .  We’ll  need  its  result  in  the  further  work. 

Theorem  1.  There  exists  matrix  with  integer  elements  (  dimension 

nx  n).  The  first  row  of  the  matrix  and  its  d6tA^( fl,.. P,)  =  1  consist  of  the  coefficients  of  the 


basical  equation. 

Proof.  The  proof  will  be  performed  by  induction.  For  n—2  matrix  has  the  form: 
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(r 


where  OC  and  j5  are  the  solution  of  pof  —  =  1  equation. 

Let’s  assume  that  everything  is  right  for  IK  k .  Prove  it  for  /7=  /C .  Represent  the  resulting 
matrix  in  the  form  of  two  matrixes’  multiplication:  where  matrix  has  the  form: 


''a 

0 

...  0 

Pn^ 

0 

1 

...  0 

0 

B„  = 

i 

•  • 

* 

0 

0 

...  1 

0 

0 

...  0 

p) 

where  a  =  GCD{p^,...p^_^)  and  (X  ^  P  are  the  solution  of  equation: 
a|3-p^  =  ^ 


Note  that  det  =  1 . 

Matrix  C„  has  the  following  form; 

( 

0 

I  0  ...  0  lj 


where  Oj  =•— ,  /  =1,  /7— 1 . 

a 

Matrix  exists  by  the  induction’  assumption. 

detC„  =  detA^i((Pi . <i£>^i)=1 

Hence 

det  A„  =  detB„  =  detC„  =  1 

Matrix  A„  is  integer  as  matrixes  B^  and  C„  are  integer.  Let’s  consider  the  first  line  of  the  matrix 
A„(  p, .. . ,  Pn) .  The  elements  of  the  first  line  are  calculated  on  the  formula: 

k 

^,n  ““  Pn  ’ 

Now  let’s  start  considering  the  method  of  loop  parameters’  determination. 

Front  of  computations  for  X  /  =  1,  AT?  variables  is  considered. 
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P^M--  +  Rjn+K  =  ^1 


(13) 


Pr,M-->+Pjn  +  K  =  K, 

Denote  (T,  =  NOD{  . . .  fl  „) .  Using  the  algorithm  of  reduction  in  2  we  transit  to  the  following 
equations: 

Piyi+-.+  R.n4+[/?J  = 


(14) 


PrM-’-^Pjn+^r  =  Kr 


Jn^ 

Jnj 

^  P  ■ 

where  and  ]j{  I  =  — ^ .  Perform  the  following  coordinate’s  transformation: 

^  '  /T 


^  n(  Pl,l » ••• »  Pl,n) 


Matrix  is  chosen  according  to  theorem  1,  where  the  first  equation  from  (14)  is  chosen  according  to 
theorem  1  where  the  first  equation  from  (14)  is  a  reduced  basical  equation  and  the  first  row  in  matrix 
A^.  Denote: 

P^.n)  “  (  Pf,n)^n  /C  —  2,  T 

We’ll  consider  the  following  system  of  equations  instead  of  system  of  equations  (13): 

.  (15) 


pf;24"+-.+4!n4"  +  4!i(^'^ -[^J)+A',  =  K, 

where  we  use  the  fact  that  i"  =  y,  =  0}*'  -[/?);,]•  Using  algorithm  of  reduction  we 

transit  from  the  system  of  equations  (15)  to  the  system  of  equations: 


K. 


(16) 


where  the  expression  for  has  the  form: 

aJ-4  +  4(K]-K.,] 


Thus  we  have  the  system  of  equations  similar  to  (14)  differed  only  in  smaller  number  of  equations  and 
index  variables.  Further  we’ll  continue  this  process  till  we  get  the  equation  of  the  following  type: 
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p,r"/r*'+...+irr/r“ + K]=  <•< « 

sigtf  K»)'«"  +  [R/;.]=  4'  '■=  n 

The  methods  of  derivation  of  index  expressions  for  such  an  equation  is  given  in  2.  We  obtain 
the  values  for  indexes:  (  for  the  case  r<  H)  as  a  result  of  the  last  equation’s 

solution.  Now  we  carry  out  the  following  sequence  of  back  transformations  for  the  initial  indexes’ 
derivation: 


[irv 


[irv 


Further  we  cany  out  the  following  transformation  using  the  fact  that  ^ 


^/n-r+4^ 

/^r+4  I 
\  n  ) 

“  A>-r+3 

1  "  y 

Thus  process  will  be  continued  till  the  values  for  the  initial  indexes  of  the  task  are  obtained: 


P'l 

II 

; 

4  The  fronts  of  computations  in  3-dimensional  domain 

To  illustrate  method  of  loop  parameters  determination  given  above  the  fronts  for  the  variables 
defined  on  3-dimensional  domain  are  considered.  The  front  of  computations  set  by  two  equations  are 
considered  in  the  first  part .  The  front  of  computations  is  set  by  three  equations  in  the  second  part. 


4.1  Sequential-parallel  computations 

Let’s  assume  that  the  front  for  some  variable  is  set  by  two  following  equations: 

ftl/i  +  /^24+a34+^2  = 

Basical  equation  defines  the  points  of  the  plane  where  requested  value  of  the  variable  is  to  be 
calculated.  The  second  equation  specifies  the  order  of  the  points’  traversal  on  the  plane  set  by  the  first 
equation. 

Informally  it  means  the  following  things.  The  traversal  of  the  points  with  integer  coordinates 
belonged  to  the  plane  set  by  the  first  equation  is  to  be  organized.  The  order  of  the  traversal  is 
following.  As  the  values  of  the  points’  coordinate  must  satisfy  both  equations  (17)  (that  is  the  setting 
of  the  line  in  the  space  if  /Cg  and  /C^  values  are  fixed)  then  really  we  need  to  organise  the  traversal  of 
the  points  with  integer  coordinates  along  the  set  line.  The  computations  of  variables’  values  on  such  a 
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line  may  be  carried  out  parallel.  Our  purpose  is  to  find  such  expressions  for  indexes’  values  (/,,  4*  4) 
(which  are  the  expressions  of  some  parameters  dj,  4,  ^2  variable  increases  when 

the  values  of  parameters  Cl[,  C^,  increase.  Then  increasing  the  value  of  Kg  variable  we  start  the 
traversal  of  the  points  along  the  next  line  etc. 

Denote  (T|  =  GCD{P\  i? Pi  25/^13)  •  Using  algorithm  of  reduction  transit  from  the  system 
(17)  to  the  following  system  of  equations: 

Ai'i"'' A24  +  ?l.34+[^i]  =  ^1 
PiJz  ^2  “  ^2 

Denote:  a  =  GCZXAj  »A,2)  •  assume  that  7^  ^  =  a^^and  jC|2  =  a^a-  Note  that 
GCD{(p^,(p2)  =  ^  and  GCZ)(  a,  3)  =  1 .  Denote  by  (Tandy  the  solutions  of  the  equation; 
(p■^C>  —  (Pz/  and  by  and  Ct  solutions  of  equation; 

aj8-A3a=1- 

Consider  the  following  matrix: 

^a(p,  a(p2 

A3  =  7  (T  0 

^acp,  a(p2  P  j 

Transform  the  initial  coordinates: 

fi\ 

'1  '1 

As)  4  ~  ^ 

J3) 

and  denote  ( jcf  3)  =  ( Ai >  Fiz’  Ph)’ ^3'  ■  second  equation  assumes  the  form: 

fiA+fi3i!+ ^2= K2 

As  ^  -  K]  then  the  second  equation  from  (18)  assumes  the  form: 

(^zi  Tizi!  +  7^3^)  -  /^2  =  -  -  K  ]  -  ^'2  (20) 

where  (7-,  =  GCD{p^ .  Rewrite  equation  (20)  in  the  form: 

0-2^  -  /<2  =  -  4(  ]  -  A'2 

where  f,  =  7^2^2  Ti,3^  •  solution  of  this  equation  has  the  form: 

4  =  4 

Kz  =  4^(4''-[^IK,]+^2+<^z4 


(21) 
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Then  all  the  parameters  will  be  reduced  to  the  common  parameter  and  we  start  solving 
problem: 

Write  the  solution  of  this  equation: 

Note.  The  choice  of  the  sign  before  parameter  is  arbitrary.  It  corresponds  to  the  fact  that  the 
traversal  of  the  points  is  done  by  the  sequential  examination  of  the  lines  but  the  traversal  along  eveiy 
line  is  done  parallel  on  every  point. 

The  explanations  of  some  notations’  usage  is  given  below.  In  this  case  ^  and  ^  are  the 
solutions  of  Euclid’s  equation:  has  the  form: 

The  final  solution  is  evident: 

r  cf-p,]  ^ 

3  •  2  Sequential  computations 

Let  the  front  of  some  variables  be  set  by  three  following  equations: 

fly;  +  fl.24+fl,34+^  = 

Fk^i^  +  R>2i2+  fi.3i3+^2  =  f<2  (23) 

ftl/;  +  A24+a34+A'3  =  f<3 

The  front  of  computations  set  in  this  form  determines  strict  sequential  order  of  computation  of 
the  variables  on  the  plane  set  by  the  basical  equation.  The  first  two  equations  set  the  family  of  the  lines 
(when  varies  from  ^‘"to  ^  previous  case.  At  first  the  values  of  the 

variables  in  the  points  of  the  first  line  (when  then  increasing  the  value  of  K2  by  1 

we  start  the  traversal  of  the  points  on  the  next  line  etc.  The  traversal  of  the  points  along  every  line  was 
arbitrary  in  the  previous  case.  When  the  front  of  computations  is  set  by  three  equations  the  last  one 
determines  the  direction  of  the  traversal  of  the  points  along  the  lines. 

As  it  was  in  the  previous  case  instead  of  system  of  equations  (23)  we  start  considering  the 


following  ones: 
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(24) 


P\.\h  +  Pj2+A3h+[\]  = 

P2,ih  Pl.jh  Pz.ih  "^^2  “  ^ 

P3,/l  ■*■^^3,24  ■*■  A,3^3  ■*■^3  ~  ^3 

Using  the  transformation  of  the  coordinates  (19)  we  obtain  instead  of  the  second  and  the  third  equations 
the  following  ones: 

pIJi  +  pI  +  p\i  W*'  -  [^/i, } + 4  =  -^2 

A, 2*2  Px^h  _^,J'*'^3  “  ^3 

Use  the  algorithm  of  reduction  and  obtain  the  following  system  of  equations: 

Then  using  the  transformation  of  the  coordinates: 


(25) 


(26) 


P2:2  A.3 

<33  > 

where  (3  and  (X  are  the  solutions  of  equation  P2j2^  ^  Pi.sP  denoting 

(Pz  i^pIs)^  (pI  i^pIs)'  -^2  ^  obtain  the  second  equation  from  (26)  in  the  following  form: 

pyi + Avt  +Ki(? '  -  ) + ^4 = -^3 

As  Zt  ^  ^2  ***  we  may  rewrite  the  last  equation  in  the  following  form: 

A,3*3  ~  ^3  ~  ~PlA^i  '  ~  ["^*1 }  ~  Pi. 2  (^1  ^  J  ~  ^^3 

The  solution  of  this  equation  has  the  form: 

/J  =  sign{pJj)</j' 

Reduced  to  common  parameter  we  obtain  the  solution  for  ij  in  the  form: 

ii  =  sign{p3^3}(J3*’ 

Now  we  start  the  procedure  of  the  initial  indexes'  derivation.  At  first  we  transform: 

f  '  ^ 


=  a;' 


p'l 

4 

11 

> 

^  JL 

N 

, - , 

and  then: 
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THE  EXAMPLE  OF  LOOP  OPERATORS’  CONSTRUCTION 

We  consider  the  example  of  loop  operator’s  construction  for  the  following  system  of  relations: 

^ij,k  yL  (^J,/r+I ) 

Kjj.  = 

Set  values  are  to  be  calculated  in  the  domain  set  by  the  system  of  inequalities: 

/  6(1,10);  y€(l,10);  A:  6  (1,10). 

For  X,Y,Z,U  and  V  variables  the  following  equations  of  the  computations’  fronts  are 
obtained: 


X: 

2/  +  1  = 

j  = 

^2 

Y\ 

2/ 

-2A:  +  1  = 

-j  =  K, 

Z: 

2/ 

-Ik  = 

j  =  K, 

U: 

2/  +  1  = 

—k  = 

^2 

V: 

2/ +2  = 

k  = 

^2 

Usin 

g  the  algorithm  of  reduction  we  obtain  the 

following  systems  of  computations’  fronts. 

X: 

;  = 

j  = 

^2 

Y: 

/  = 

-2k +  \  = 

1 

It 

Z: 

i  = 

-2k  = 

J  =  K, 

U: 

/  = 

K,, 

-k  = 

^2 

V: 

Z>1  = 

a:,, 

k  = 

^2 

Here  the  values  of  parameters  r/ ,  I  -  X,Y,  Z,U ,V  has  the  following  form: 
r''^=l,  r,>'=0,  r,"  =  0,  r/' =  1,  r/' =0. 

It  is  easy  to  prove  that  the  range  of  parameter  d^  changing  is  (1,11).  Besides  the  range  for 
every  value  (except  V  value)  is  (1,10)  and  for  V  -  (2,1 1). 
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DO  1  = 

IF  c?,€(U0)  IF  c/,e(2,ll) 

F~^  0 

c  - 

C  XU 

C  Y,Z,V, 

c  =  l 

IF  c?,e(U0)  IF  c/,€(l,10) 

S  El 

1  CONTINUE 

Determination  of  loop  operator’s  parameters  for  X^U  and  V  values  isn’t  difficult.  The  fronts 
of  computations  of  this  variables  are  different  thus  they  are  considered  separately.  The  procedure  of 
reduction  to  the  common  parameter  isn’t  necessary.  Thus  we’ll  consider  one  of  these  variables. 

The  front  of  computations  for  variable  X  has  the  following  form: 

/  =  d, 
j  =  K, 

Matrix  of  transformation  in  this  case  is  a  unit  one.  Change  the  system  of  notations: 

A3 


fO 

(T^ 

j 

= 

J 

k  . 

k 

\n.j 

K  J 

Corresponds  to  (20)  the  second  equation  assumes  the  form: 

j  =  d,. 

The  form  of  the  index  expressions  follows  from  (22): 

I  =  d, 
k  =  ±d^ 


Transiting  to  the  initial  notations: 


j 

=  a;'- 

dr 

We  obtain  the  solution  : 

J=d,. 
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7  =  d, 

k  =  ±d^ 

Thus  the  scheme  of  X  variable  computation  has  the  following  form: 

DO  2  =1,10 

DO  2  t/3  =  U0 
X{d„d^,d,)=... 

2  CONTINIJE 

Nested  loop  (on  d^ )  may  be  carried  out  parallel.  It  is  evident  because  before  parameter  d^ 
stands  sign 

Consider  the  fronts  of  computations  for  Y  and  Z  variables: 

7:  /  =  K„  -2k  +  \  =  -j  =  K, 

Z:  i  =  A,,  -2k  =  K^,  j  =  K^ 

As  the  front  of  computations  is  set  by  three  equations  then  both  these  variables  are  calculated  strictly 
sequential.  Unit  matrix  of  transformation  and  equations  (25)  has  the  following  form: 

7:  -2jfc  +  l  =  -J=K^ 

Z:  -2k  =  K„  J=K, 

Using  the  algorithm  of  reduction  we  obtain  the  following  fronts  of  computations: 

7:  -k=d.  -J=K, 

Z:  -k=d^  j=K, 

Parameter  ^  has  the  following  values: 

/f  =1,  if  =0. 

Matrix  of  transformation  for  obtained  fronts  of  computations  has  the  following  form: 


Transform  the  coordinates: 


^0  -1^ 

(A 

J  0/ 

UJ 

One  equation  for  the  front  of  computations  of  Y  variable  is  left  as  a  result: 

-l<=K, 

and  one  equation  for  the  front  of  computation  of  Z  value: 

Then  we  obtain  the  solution  in  the  initial  notations.  At  first  we  carry  out  the  transformation  of 


the  following  type: 
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fo 

1-1  oJt-a^J 


y 


And  finally  we  obtain: 


i 

=  A3 

-q5 

The  solution  for  Z  variable  is  done  in  the  same  way. 

The  scheme  of  the  program  is  given  below. 

DO  1  C](=1,11 

IF  a[€(1,1Q  THEN 
C  Z 

c  z 

C  Y 

DO  2  4  =-1Q1 
DO  3  =1,10 

3  Z(a{,a^,-G^)=  fe(y(cf.c^,-c4+1),Z(of,q^-1,-c^)) 
c  Y 

DO  4  =-1Q-1 

y(c(,-4,-G^)=  /y(X(c(-i,-c^,-c4),y(c(,-4+i,-4),u(Q[-i,-4,-c^),z(Qi,-c^,-4)) 

4  CONTINUE 
2  CONTINUE 
C  V 

IF  4  €  (2,1  1)  THEN 
DO  5  4  =  1>1  0 

c  4 

DO  5  4  =  1,1  0 

5  V(4- 1,4,4)=  ^u(f^(a{-1.c^+1.4).'^(a{-1.4.4-1)) 


c  X 

IF  4  S  (1,10  then 
DO  6  4  =1,10 


19 


DO  6  Cj  =1,10 

6  X(c{.c^,£4)= 

c  U 

IF  0(6  (1,1  Q  THEN 
DO  7  4  =-1Q-1 

DO  7  4  =-1Q-1 

a(4,-4,-4)=  /'u(V'(4,-4,-4+i),cy(4,-4,-4+i),\/(4-i-4-i-4)) 

7  CONTINUE 
1  CONTINUE 
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